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Introduction. 


Our  research  efforts  are  concerned  with  undetermined  coefficient  problems  in  partial  differen¬ 
tial  equations,  in  particular  those  problems  where  the  unknown  coefficients  depend  only  on  the 
dependent  variables.  The  problems  modeled  by  these  equations  are  related  to  the  determination  of 
unknown  physical  laws  or  relationships.  The  nonlinear  terms  which  we  seek  to  recover  in  our  model 
problems  correspond  to  material  properties  that  have  physical  significance;  temperature  dependent 
specific  heats,  conductivities,  reaction  terms,  to  name  a  few.  Examples  of  such  problems  are  - 
the  determination  of  an  unknown  reaction  term  /(•)  in  ut  -  uxx  =  f(u) ,  or  the  conductivity  k(-) 
in  the  equation  V.&( u)Vu  =  0.  We  seek  to  determine  these  functions  by  giving  only  overposed 
boundary  data. 

This  type  of  problem  is  distinct  from  those  that  involve  media  with  unknown  inhomogeneities; 
that  is,  the  differential  equations  contain  an  unknown  coefficient  that  depends  on  the  independent 
spatial  variable.  In  a  given  physical  problem  both  situations  may  occur,  that  is,  the  unknowns  have 
both  spatial  as  well  temperature  dependence.  This  is  a  considerably  more  difficult  problem  and 
has  received  little  attention  in  its  full  generality;  instead  both  the  limiting  cases  of  dependence  on  a 
single  one  of  these  variables  have  received  the  vast  majority  of  the  research  efforts.  For  those  cases 
where  the  media  is  isotropic,  the  assumption  that  the  unknowns  depend  only  on  the  independent 
variable  may  be  a  very  reasonable  one.  From  a  mathematical  standpoint  the  two  types  of  problem 
pose  different  mathematical  difficulties.  Both  types  of  inverse  problem  leads  to  nonlinear  equations, 
but  in  the  case  when  the  unknown  coefficient  depends  on  the  spatial  variables  the  direct  problem 
is  frequently  linear.  In  contrast,  when  the  coefficient  is  a  function  of  the  dependent  variable,  the 
direct  problem  is  nonlinear. 

Examples: 

(I)  The  recovery  of  the  one  (or  more  than  one)  of  the  functions  c,  ifc,  /  or  h  from  the  parabolic  equation 

c(ti)u(  -  V  .  fc(u)Vu  =  /(u)  +  y(x,t)  infix[0,r]  (1) 

with  initial  data  u(x,  0)  =  uo  ,  and  boundary  data 

^  =  h(u)  +  a(z,  t)  x  €  dSi  (2) 

We  prescribe  as  overposed  data  the  value  of  u  at  a  specified  point  xo  on  d£l .  Of  course,  if  more  then 
one  function  is  unknown  additional  overposed  data  must  be  prescribed.  This  can  be  either  the  value 
of  u  at  other  boundary  points,  or  the  value  of  u  at  the  same  point  for  different  experiments,  that  is, 
for  values  of  the  known  functions  y ,  a  or  uo .  The  functions  c ,  fc ,  /  and  h  represent  the  specific 
heat,  conductivity,  reaction  term,  and  radiation  boundary  condition  respectively.  An  alternative  type 
of  overposed  condition  would  be  to  prescribe  the  total  energy  of  the  system  as  a  function  of  time.  This 
leads  to  recovery  problems  with  non-local  boundary  conditions. 

(II)  The  recovery  of  one  of  the  functions  k  or  /  in 

-V .  fc(u)Vu  =  /(u)  +  y(x,  y)  in  Q  (3) 

given  Neumann  data  on  dfi ,  plus  the  value  of  u(x)  =  9(x)  on  some  subset  of  the  boundary  as  overposed 
data.  Once  again,  if  both  k  and  /  are  unknown  then  we  should  prescribe  additional  overposed  data. 
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(Ill)  The  recovery  of  the  functions  f\{4>)  and  h(4>)  in  the  parabolic  system 


u«  -  D\  Au  =  arti  +  fi(4>) 

Vi  -  Dt  Av  =  f)v  + 

where  <j>  is  a  known  function  of  u  and  v  and  represents  the  interaction  between  these  functions.  A 
typical  example  might  be  <j>  =  uv . 

(IV)  If  it  is  known  that  the  coefficient  may  depend  on  the  gradient  of  u  as  well  u  itself, 

««-«*»= /(«,«»)  in  Q  x  [0, T]  (4) 

then  we  might  try  to  use  a  finite  expansion  of  the  function  in  the  variable  Vu,  and  look  for  an 
approximation  to  /  in  one  of  the  forms  /(u,  u,)  =  /i(u)+/j(u)u»  or  /( u,  u,)  =  {/i(u)+/j(u)«i}/{1+ 
/3(u)us}.  The  first  is  a  linear  approximation  in  the  variable  u, ,  the  second  a  rational  function 
approximation.  In  either  case  we  must  recover  a  vector  of  unknowns  (/i(u),  /2(u), . . .  ) 


Our  interest  in  these  problems  is  broad.  We  seek  to  identify  the  correct  types  of  data  under 
which  the  problem  has  a  solution.  A  uniqueness  result  indicates  sufficiency  of  data,  an  existence 
result  indicates  that  the  problem  is  not  overdetermined.  The  extent  to  which  the  solution  depends 
continuously  on  the  data  is  clearly  important  information  for  the  implementation  of  any  practical 
scheme.  In  addition  to  these  questions  we  are  interested  in  constructibility  of  the  solution  and  in 
the  development  of  algorithms  that  lead  to  efficient  numerical  recovery. 

Unknown  coefficient  problems  are  notorious  for  several  reasons;  they  tend  to  be  difficult  math¬ 
ematical  questions,  they  are  often  severely  ill-posed,  and  when  a  solution  is  found  it  tends  to  be 
ad  hoc.  While  not  much  can  be  done  about  the  first  two  of  these,  numerous  attempts  have  been 
made  to  resolve  the  third.  The  lack  of  success  in  this  endeavor,  even  for  seemingly  restricted 
cases  (parabolic  equations  with  a  single  unknown,  u  — dependent  coefficient)  is  well  known.  A 
major  theme  at  the  recent  Areata  conference*  was  the  discussion  of  various  attempts  at  providing 
“general”  solution  techniques  for  such  problems. 

Another  feature  of  these  inverse  problems,  due  in  part  to  their  intrinsic  nonlinearity,  is  that 
existence  of  a  solution  does  not  imply  uniqueness  or  vice-versa.  Indeed  there  are  many  problems 
for  which  uniqueness  has  been  shown  but  existence  questions  remain  completely  open.  Even  in 
those  cases  where  there  has  been  successes,  methods  yielding  existence  are  often  entirely  different 
form  those  that  gave  uniqueness,  and  the  time  span  between  the  proofs  of  uniqueness  and  that 
of  existence  can  be  considerable.  It  was  20  years  in  the  case  of  the  recovery  of  a  potential  in  an 
ordinary  differential  equation  from  spectral  data. 

The  research  proposed  and  our  current  endeavors,  focus  around  the  examining  of  several  ap¬ 
proaches  that  have  been  used  with  success  in  other  areas  and  attempt  to  apply  them  to  the  types 
of  problem  mentioned  in  the  examples  above.  Each  method  suggests  a  possible  algorithm  for  the 
inverse  problem,  our  main  interset  is  to  determine  those  that  are  feasible,  attempt  to  prove  conver¬ 
gence  of  the  method,  and  if  possible  compare  competing  methods  for  computational  efficiency. 

*  Inverse  problems  in  Partial  Differential  Equations  AMS/SIAM/IMS  Summer  Conference  Series,  Ar¬ 
eata  California,  July  26-August  4,  1989 
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Work  by  the  investigators  prior  to  the  grant  period. 

In  a  series  of  papers  the  investigators  have  developed  computational  algorithms  based  on  iteration 
techniques  which  are  applicable  to  a  wide  variety  of  problems  of  both  elliptic  and  parabolic  type. 
The  main  strategy  of  this  procedure  can  be  outlined  as  follows.  Using  an  initial  approximation 
to  the  unknown  function,  one  solves  the  direct  problem  to  obtain  the  solution  of  the  differential 
equation,  u(x,t).  Evaluating  the  differential  operator  on  the  overposed  boundary,  then  using  the 
solution  of  the  direct  problem  and  the  overposed  data,  one  obtains  a  nonlinear  equation  for  the 
unknown  coefficient  or  term  that  is  to  be  reconstructed.  Solving  this  nonlinear  equation,  a  new 
approximation  to  the  unknown  term  is  obtained.  This  procedure  is  repeated  until  convergence 
results.  Since  this  method  involves  projecting  the  differential  operator  onto  that  subset  of  the 
domain  where  the  overposed  data  is  given,  it  has  been  referred  to  as  the  Fixed  Point  Projection  or 
FPP  method.  The  terminology  “fixed  point”  refers  to  the  reformulation  of  the  inverse  problem  as 
an  equivalent  fixed  point  problem. 

As  an  illustration  of  the  technique,  consider  the  recovery  of  the  solution  pair  ( u ,  /)  in  ut  —  uxx  = 
f(u)  with  uc(0,  t)  =  go(t),  ux(l,t)  =  gi(t)  and  u(x,0)  =  «<>.  We  denote  the  solution  of  this  direct 
problem  by  u(x,t;f)  in  order  to  show  the  dependence  of  u  on  the  function  /.  Let  the  overposed 
data  be  u(0 ,  f)  =  0(t) .  If  the  differential  equation  is  evaluated  on  the  section  of  boundary  where 
the  overposed  data  is  prescribed  (x  =  0),  then  we  obtain  /(0(f))  =  9'{t)  —  uri(0,f;  /)  =  T[/] .  We 
can  show  that  the  inverse  problem  is  equivalent  to  proving  the  existence  of  a  unique  fixed  point  for 
T ,  and  give  conditions  on  the  data  for  which  such  a  fixed  point  exists.  The  solution  can  then  be 
obtained  by  the  iteration  scheme  /n+i  =  T[/n] . 

With  this  method  the  investigators  have  been  able  to  prove  uniqueness  and  convergence  results 
for  the  following  inverse  problems. 

(a)  The  recovery  of  the  solution  pair  (u,/)  in  u>  —  Au  =  /(u)  +7  given  the  initial  value  u(x,0)  and 
Neumann  data  on  81 ,  plus  the  value  of  u(xo ,t)  =  9(t),  for  some  point  xo  €  <9fl ,  as  overposed  data, 

[4] . 

(b)  The  recovery  of  the  solution  pair  (u,  /)  in  Au  =  /( u)  given  Neumann  data  on  81 ,  plus  the  value  of 
u(x)  =  9{x)  on  some  subset  of  the  boundary,  [3]. 

(c)  The  recovery  of  the  solution  pair  (u,  k)  in  u»  —  V  .  k( u)Vu  =  0  given  the  initial  value  u(x,  0)  and  the 
flux,  k(u)8u/8u  on  81,  plus  the  value  of  u(xo,f)  =  9(t),  xo  €  81  as  overposed  data,  [7]. 

(d)  The  recovery  of  the  solution  pair  (u,c)  in  c(u)u»  -  Au  =  7  given  the  initial  value  u(x,0),  Neumann 
data  on  81,  with  u(xo,f)  =  9(t),  xo  6  <9ft  forming  the  overposed  data,  [7]. 

(e)  The  recovery  of  the  solution  pair  (u,  h)  in  the  one  dimensional  heat  equation  subject  to  the  nonlinear 
boundary  conditions  ux  =  h(u)  on  81.  The  value  of  u(0,  t)  =  9(t),  is  given  as  the  overposed  data, 

[5] , 

Results  of  some  of  the  numerical  computations  can  be  found  in  [1]. 

In  addition  to  this  work  we  have  used  the  method  to  obtain  numerical  algorithms  for  the 
solution  of  the  multiple  unknown  coefficient  problems  in  items  (III)  and  (IV)  above  as  well  as  the 
recovery  of  both  c(u)  and  f(u)  in  c(u)ut-uxx  =  /(u),  [8].  These  problems  are  considerably  more 
complex,  and  to  date  we  have  not  been  able  to  answer  uniqueness  or  convergence  questions.  Some  of 
the  complications  include  the  differing  spaces  in  which  the  various  coefficients  lie,  and  in  obtaining 
physically  meaningful  conditions  on  the  data  that  allow  one  to  solve  the  update  equations.  We 
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have  been  able  to  provide  necessary  conditions  on  the  data,  and  to  demonstrate  the  feasibility  of 
the  approach  for  these  problems.  The  last  part  of  this  work  was  completed  under  the  present  ONR 
grant  period. 

For  certain  problems,  the  recovery  of  k(u)  in  V .  k(u)Vu  =  7  and  the  recovery  of  the  function 
h(u)  in  the  nonlinear  boundary  condition  ^  =  h(u)  for  the  heat  equation  in  n  spatial  dimensions, 
we  have  been  able  to  use  direct  approaches  and  obtain  uniqueness  results  under  quite  general 
assumptions  on  the  data,  [6,  13].  Neither  of  these  problems  appear  to  be  suitable  for  application 
of  the  FPP  method.  The  second  of  these  problems  was  done  during  the  ONR  grant  period. 

The  FPP  method  used  in  the  above  problems  has  the  advantage  that  it  is  applicable  to  a  wide 
class  of  undetermined  coefficient  problems  of  parabolic  and  elliptic  type,  and  is  easy  to  implement 
provided  that  the  boundary  conditions  are  in  a  particular  form  (specifically,  the  overposed  data 
should  consist  of  Dirichlet  data  on  some  subset  of  the  boundary).  In  addition,  as  long  as  the  data 
is  sufficiently  “noise-free,”  the  method  is  quite  robust.  Convergence  is  rapid,  and  takes  place  in  the 
optimal  space  for  the  problem,  that  is  the  space  with  the  maximum  regularity  permitted  by  the 
data. 

On  the  other  hand,  the  approach  has  its  drawbacks:  It  is  only  applicable  when  the  differential 
equation  (or  that  part  of  the  differential  operator  containing  the  unknown  coefficient)  is  evaluated  on 
the  overposed  boundary,  one  must  be  able  to  solve  the  resulting  equation  for  the  unknown  coefficient 
on  that  section  of  the  boundary  where  the  overposed  data  is  specified.  The  overposed  data  has  to 
be  monotone  on  that  section  of  the  boundary  where  it  is  specified,  and  since  implementation  of 
the  algorithm  always  requires  the  differentiation  of  the  overposed  data,  any  “noisy”  data  should  be 
smoothed  before  use  in  the  numerical  inverse  solver.  The  algorithm  is  sensitive  to  the  amount  and 
type  of  smoothing  employed.  Finally,  it  is  often  difficult  to  prove  the  convergence  of  the  algorithm. 

Overcoming  some  (or  all)  of  these  restrictions  is  the  primary  motivation  for  our  further  research 
on  these  problems.  There  are  alternative  methods,  some  of  which  have  been  used  to  solve  other 
types  of  inverse  problems,  but  to  our  knowledge  none  of  these  have  been  applied  to  the  recovery  of 
terms  containing  the  dependent  variable  in  quasilinear  equations. 


Research  under  the  grant  period. 

Our  current  research  program  consists  of  a  continuation  of  our  previous  work  in  parabolic  and 
elliptic  equations,  and  also  an  extension  of  these  ideas  and  techniques  to  undetermined  coefficient 
problems  for  first  order  hyperbolic  equations. 

For  the  elliptic/parabolic  problems  we  are  currently  investigating  several  approaches  -  Least 
Squares,  Conjugate  Gradient,  Newton  and  Quasi-Newton,  Homotopy,  and  Collocation  methods. 
These  methods  can  be  broadly  classified  as  either  residual  or  non-residual  based  recovery  schemes. 

We  will  once  again  use  our  one  spatial  dimension  reaction-diffusion  example  to  illustrate  the 
main  ideas.  The  inverse  problem  for  the  recovery  of  /  is  equivalent  to  solving  the  equation 
u(0 ,t\f)  =  9(t).  More  generally  we  can  write  the  overposed  data  as  f?[u]  =  9(t)  where  B  is 
some  functional  of  u.  The  case  B[u ]  =  u(0,  f)  corresponds  to  the  Dirichlet  condition  above,  and 
B[u }  =  fg  u(x,  t)dx  would  correspond  to  a  specification  of  energy.  Note  that  B  need  not  be  a  local 
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operator. 

The  residual  methods  are  iterative  schemes  that  have  the  general  form: 

/(n+l)  =  f(n)  _  [£>n]  ^ B[u(x  t ;f(n))  _  (5) 

where  Dn  is  linear  operator.  The  FPP  method  is  a  special  case  of  the  above,  [4],  as  are  the  Newton, 
Quasi-Newton  and  Homotopy  methods. 

If  one  knows  a  priori  information  about  /  (for  example  it  is  positive  or  monotone  or  very 
smooth),  then  one  may  be  able  to  incorporate  this  information  into  the  choice  of  D„  and  regularize 
an  otherwise  poorly  behaved  method.  Smoothing  splines,  moving  averages,  spline  interpolation, 
non-uniform  weight  methods  sire  some  operations  which  are  typically  incorporated  into  the  operator 
Dn .  Slight  changes  in  the  form  of  Dn  can  result  in  substantial  changes  in  the  performance  of  the 
algorithm. 

We  have  looked  at  general  residual  schemes  and  have  shown  that  one  of  them  is  equivalent  to 
the  FPP  method  for  the  recovery  of  /  in  in  the  reaction-diffusion  case  when  the  overspecified  data 
is  of  Dirichlet  type.  It  is  interesting  to  note  that  although  the  FPP  method  is  not  applicable  to 
other  boundary  conditions  such  as  the  prescription  of  total  energy  E(t)  =  ja  u(x,  t)dx ,  the  residual 
update  scheme  can  in  fact  be  used  in  this  case.  We  have  obtained  some  numerical  results  for  this 
modified  scheme  that  are  extremely  encouraging. 

An  example  of  a  non-residual  update  scheme  are  collocation  methods.  In  one  particular  imple¬ 
mentation,  we  use  the  evolutionary  nature  of  the  equations  to  sequentially  determine  the  coefficients 
cty  in  the  basis  expansion  /  =  by  solving  the  single  nonlinear  equation 

»=i+i 

0(tj+i)  =  B[u(x,tj‘,  ««&)] 

<=o 

for  a  monotone  sequence  of  points  {f;} .  We  assume  that  the  coefficients  {a,},  1  <  *  <  j,  have 
been  determined  for  {f,},  1  <  t  <  j .  This  leaves  a  single  unknown  arJ+i  to  be  determined  by  the 
remaining  equation. 

Another  example  of  a  non-residual  update  scheme  is  the  least  squares  method.  This  is  a 
popular  approach  especially  for  recovering  a  discrete  set  of  parameters.  Output  least  squares  has 
been  used  extensively  in  control  theory. 

A  survey  of  all  of  these  methods  as  they  apply  to  the  above  type  of  undetermined  coefficient 
problem  was  presented  by  the  investigators  in  the  proceedings  of  the  Areata  conference. 

In  this  last  half  year  we  have  concentrated  on  the  collocation  method,  and  applied  it  to  two 
particular  problems:  one  for  the  recovery  of  an  unknown  boundary  condition  in  the  heat  equation, 
and  the  other  for  an  interior  coefficient,  (forcing  term). 

For  the  first  problem  considerable  insight  has  been  obtained.  We  have  been  able  to  show  that 
there  is  a  unique  piecewise  linear  approximant  Jn  to  the  function  /  such  that  the  solution  of  the 
direct  problem  u(0,  t\  Jn)  agrees  with  the  overposed  data  at  each  of  the  N  collocation  points.  In 
addition,  we  have  given  conditions  under  which  the  function  fs  will  converge  to  the  function  /  in 
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the  supremum  norm,  and  proved  that  an  effective  rate  of  convergence  of  N~1!2  results.  This  work 
has  been  recently  submitted  [9],  and  details  on  the  numerical  implementation  of  the  algorithm 
as  well  as  its  application  to  boundary  control  will  appear  in  the  1989  Tampa  CDC  conference 
proceedings. 

For  the  recovery  of  the  forcing  term  f(u)  in  ut  —  Au  =  /( u)  we  have  been  able  to  show 
uniqueness  of  the  approximating  piecewise  linear  function  fa.  We  can  show  that  fa  cannot 
be  expected  to  converge  to  /  in  the  supremum  norm,  requiring  that  u(0 ,t]fa)  agrees  with  the 
overposed  data  at  the  N  collocation  points.  We  believe  that  one  can  obtain  convergence,  if  instead 
of  collocating  u(0,  t;  Jn)  -  0(t) ,  one  collocates  the  derivatives  of  these  quantities,  ut(0,  t;fa)~  6'(t) . 
This  situation  would  be  entirely  in  keeping  with  the  requirements  of  our  previous  results  using  the 
FPP  method,  but  we  have  not  as  yet  been  able  to  prove  a  positive  convergence  result  for  this  case. 
Numerical  experiments  bear  out  the  conjecture. 

In  light  of  the  success  of  the  collocation  method,  we  intend  to  investigate  the  interior  problem 
further,  and  also  to  use  the  method  for  resolving  other  inverse  problems.  Incremental  collocation 
schemes  have  several  attractions;  in  theory  they  are  widely  applicable  and  offer  a  way  to  gain 
insight  into  problems  with  multiple  unknowns.  The  method  is  fairly  straightforward  to  implement, 
and  when  applicable,  is  extremely  computationally  efficient.  Indeed,  in  some  cases  the  cost  of  the 
inverse  recovery  is  roughly  the  same  order  as  that  for  a  single  direct  solve  of  the  nonlinear  problem, 
[9].  This  is  quite  rare  in  undetermined  coefficient  problems. 

Our  intention  is  to  continue  on  with  this  program,  looking  for  various  ways  to  formulate  these 
inverse  coefficient  problems.  We  have  begun  to  look  at  several  methods  that  offer  promise,  these 
include  the  Newton,  Quasi-Newton  and  homotopy  methods.  We  will  continue  our  work  on  the  FPP 
and  collocation  schemes. 

We  have  been  looking  at  undetermined  coefficient  problems  for  first  order  hyperbolic  equations 

ut  +  Aux  -I-  Bu  =  C  (6) 

subject  to  initial  and  boundary  conditions.  These  equations  are  ubiquitous  in  applied  mathemat¬ 
ics  and  occur  in  such  diverse  areas  as  hydrodynamics,  traffic  flow,  neutron  transport  theory  and 
population  dynamics.  While  the  direct  problem  corresponding  to  these  models  are  reasonably  well 
understood,  the  same  cannot  be  said  for  inverse  problems.  The  coefficients  A,  B ,  and  C  as  well 
as  terms  that  occur  from  the  boundary  may  very  well  be  unknown,  and  additional  data  must  be 
made  available  for  their  recovery. 

In  order  to  get  a  better  feel  for  the  more  complex  problems,  we  have  looked  at  several  special 
cases,  in  particular  when  A  is  a  constant.  In  this  case  the  characteristics  are  known  and  the 
differential  equation  is  simpler  to  analyze.  We  are  unwilling  to  look  at  problems  that  are  generated 
entirely  by  mathematical  considerations,  and  thus  have  worked  on  specific  questions  that  arise 
in  a  particular  area,  in  this  case,  population  dynamics.  The  main  reason  for  this  is  that  the 
mathematical  models  almost  always  lead  to  equations  with  linear  characteristics  and  thus  avoid 
some  obvious  technical  difficulties  that  would  be  associated  with  the  direct  problem.  Although  the 
equations  we  consider  model  the  growth  of  an  age-structured  population,  where  the  birth  and  death 
rates  depend  on  age  (as  well  as  other  variables),  they  are  certainly  not  limited  to  this  application. 
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If  p(a,  t )  denotes  the  number  of  individuals  of  age  a  alive  at  a  particular  time  t ,  and  A  da  is 
the  probability  of  a  death  occurring  to  a  given  individual  in  the  period  a  to  a  +  da.  Since  this 
quantity  should  depend  on  the  age  of  the  individual  A  depends  on  a ,  and  in  the  case  of  competition, 
should  also  depend  on  p(a,t).  This  dependence  may  be  in  fact  be  on  the  total  population  at  time 
t ,  P(t)  =  J  p(a,  t)da ,  and  there  certainly  are  circumstances  where  A  =  A(a,  P)  is  the  appropriate 
model.  This  leads  to  the  following  equation, 

/><(«,  *)  +  Pa(a,t)  +  Xp(a,t)  =  0  (7) 

In  addition  to  this  we  may  prescribe  the  initial  condition 

/>(a,0)  =  <f>(a)  (8) 

and  a  boundary  condition  at  a  =  0  which  corresponds  to  the  renewal  process.  This  may  take  either 
of  the  forms 

p(0,t)  =  B(t)  (9) 

or 

/>(0,t)=  J  f3p(a,t)da  (10) 

where  (3da  is  the  probability  of  a  individual  of  age  a  giving  birth  (per  unit  time)  in  the  interval  a 
to  a  +  da.  (3  will  depend  on  a  and  probably  also  on  p(a,t).  Equations  (7),  (8)  and  (10)  constitute 
the  standard  model  for  age-structured  population  dynamics.  See  the  paper  [10]  and  its  references 
within  for  more  details.  We  will  refer  to  this  as  the  direct  problem.  It  corresponds  to  solving  a 
(possibly  nonlinear)  first  order  hyperbolic  equation  with  non-local  boundary  conditions.  However, 
our  “  species”  could  be  also  be  mechanical  parts;  p(a,  t )  would  denote  the  number  of  parts  of  age 
a  still  in  service  at  time  t ,  and  A  would  be  an  age  dependent  rate  of  failures. 

In  order  to  solve  this  direct  problem  we  have  to  know  the  values  of  A,  <j>  and  either  B  or 
(3,  and  this  is  certainly  not  reasonable  in  many  cases.  If  this  is  so,  how  can  these  quantities  be 
determined?  What  constitutes  reasonable  data  that  might  be  measured  in  order  to  identify  one  or 
more  of  these  quantities?  As  expected,  the  answer  to  this  question  depends  on  which  quantity  is 
considered  to  be  unknown,  and  on  what  variables  it  depends.  There  are  certainly  situations  where 
one  could  evaluate  p(a,T )  at  some  fixed  time  T,  this  corresponds  to  the  taking  of  a  census  or  of 
an  inventory  that  accounts  for  the  age  of  each  object.  In  other  situations  this  may  not  be  possible, 
but  one  is  able  to  measure  the  value  of  P(t),  the  total  population  or  number  of  parts  in  service. 

There  is  now  a  considerable  number  of  physically  reasonable  inverse  problems  that  might  be 
posed.  Some  of  these  include: 

(1)  Given  the  function  A  =  A(a,  P)  the  initial  data  p(a,  0)  =  4>(a) ,  and  the  values  of  P(t)  for  some  range 
0  <  t  <  L  where  L  is  the  maximum  lifespan,  determine  the  function  0(a) .  This  corresponds  to  the 
determination  of  a  nonlocal  boundary  coefficient  in  a  quasilinear  hyperbolic  equation. 

(2)  Given  the  functions  A ,  and  0  determine  the  the  initial  data  p(a,  0)  =  4>(a) .  The  overposed  data  could 
be  either  the  total  population  over  some  time  interval  [0,  T]  or  the  values  of  the  age-structure  at  a 
later  time  T ,  p(a,T). 

(3)  Given  the  functions  4> ,  and  0  determine  the  the  death  function  A .  There  are  several  cases.  The  first 
is  when  A  is  a  function  of  a  only  and  the  overposed  data  consists  of  the  values  of  the  age-structure  at 
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a  later  time  T .  A  second  is  when  A  depends  on  both  a  and  P  but  is  a  simple  sum  of  the  natural  and 
environmental  death  rates,  that  is,  A  has  the  form  A/v(a)  +  \e(P)  ,  where  Ajv  is  the  death  rate  due 
to  natural  causes  and  \e  is  the  death  rate  due  to  the  environment.  We  assume  that  Ajv(a)  is  known, 
and  the  overposed  data  consists  of  the  total  population  over  some  time  interval  [0,T]  Another  possible 
form  for  A  might  be  A (a,P)  =  A^(a)[l  +  Af;(P)].  The  first  form  assumes  that  the  environmental 
pressure  is  the  same  for  all  ages,  the  second  assumes  that  it  is  coupled  to  age  so  that  environmental 
pressure  is  greater  on  the  age  group  with  a  high  natural  death  rate. 


Problem  one  was  solved  in  [10],  where  conditions  were  imposed  for  unique  recovery.  These  re¬ 
quired  a  certain  incompatibility  between  the  initial  population  and  the  initial  birth  rate.  Without 
this  condition,  both  non-uniqueness  and  non-existence  is  possible,  and  this  was  shown  by  coun¬ 
terexamples.  In  addition  it  was  proved  that  when  a  solution  exists  it  is  unique  and  continuous 
dependence  result  could  be  obtained.  These  stated  that  the  function  (3  can  be  controlled  in  the 
supremum  norm  provided  the  overposed  data  is  controlled  in  the  C 2  norm,  proving  that  the  inverse 
problem  is  ill-conditioned.  A  counterexample  showed  this  result  to  be  essentially  the  best  possible. 
Some  numerical  examples  were  presented. 

Problem  two  was  considered  in  [11].  It  was  shown  that  uniqueness  of  the  problem  followed 
under  certain  conditions.  The  degree  of  ill-conditioning  of  the  inverse  problem  depended  critically 
on  the  behavior  of  the  function  A  in  a  neighborhood  of  the  maximum  life  span,  L  for  the  case  of 
overposed  data  P(t) ,  and  on  the  support  of  the  birth  function  0  for  the  case  of  overposed  final 
data.  Numerical  reconstructions  were  presented. 

Problem  three  is  discused  in  [12].  Uniqueness,  existence  and  continuous  dependence  results 
are  obtained  and  the  recovery  of  A e  is  obtained  by  am  iteration  procedure.  Under  certain  cases 
this  scheme  may  converge  monotonically. 

These  last  two  papers  was  work  performed  under  the  ONR  contract. 

Clearly,  the  above  represent  only  a  fraction  of  the  reasonable  questions.  The  recovery  of  both 
the  birth  and  death  functions  would  be  desirable,  as  would  the  recovery  of  one  of  these  coefficients 
as  a  function  of  two  variables,  a  and  P  (or  possibly  a  and  p).  The  case  when  the  coefficient  A  in 
(6)  is  non-constant  but  depends  on  x,  t  or  u  is  the  important  one  for  many  applications.  Much 
work  remains  to  be  done,  actually  we  have  only  scratched  the  surface.  We  propose  to  continue  this 
research  into  first  order  hyperbolic  equations,  and  we  have  two  graduate  students  who  are  starting 
to  work  on  these  problems. 

Papers  published 

During  the  past  year  we  have  submitted  five  papers  for  publication,  [7,  9,  11,  12,  13].  One  of 
these,  [13]  has  been  accepted  by  the  Journal  of  Differential  Equations,  and  a  second,  [7]  by  Nu¬ 
merical  Methods  in  P.D.E.  Another  paper,  [8],  was  revised  during  the  grant  period.  Several  papers 
incorporating  our  most  recent  results  are  in  preparation.  In  addition,  there  are  two  conference 
proceedings  resulting  from  this  work.  One  of  these  is  to  the  I.E.E.E  conference  in  Control  1 ,  and 
we  have  provided  a  copy  of  the  paper  that  will  be  published  in  the  proceedings.  The  other  was 

1  December  13-15,  Tampa  Florida. 
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presented  at  the  Areata  conference  mentioned  earlier. 


Invited  Talks 

1.  Inverse  problems  in  Partial  Differential  Equations,  AMS/SIAM/IMS  Summer  Conference  Series, 
Areata  California,  July  26,  August  4  1989.  Invited  hour  talk,  presented  by  Michael  Pilant. 
The  talk  will  appear  in  the  conference  proceedings  to  be  published  by  SIAM. 

2.  “A  collocation  scheme  for  the  identification  of  coefficients  in  nonlinear  parabolic  equations” 
I.E.E.E  conference  in  Control,  Tampa,  Florida  December  13-15,  1989.  Invited  talk,  to  be  given 
by  Michael  Pilant.  A  conference  proceedings  will  result. 

3.  “Using  Domain  Decomposition  Methods  for  Computing  Singularities  of  Equations  of  Mixed 
Type,”  SIAM  Conference  on  Domain  Decomposition  Methods,  Houston,  March  20-22,  1989. 
Presented  by  Michael  Pilant,  also  chaired  session. 

4.  “Undetermined  Coefficient  Problems  for  First  Order  Hyperbolic  Equations,”  European  Con¬ 
ference  on  Inverse  Problems,  Montpelier,  France,  November  26-December  2,  1989.  Presented 
by  William  Rundell. 

5.  AMS  Regional  Meeting  “Special  Session  on  Inverse  problems,”  Manhattan,  Kansas  March 
16-17,  1990.  Invited  talk  (William  Rundell). 

6.  SIAM  Symposium  on  “Invariant  Imbedding  and  Inverse  problems,”  Albuquerque,  New  Mexico 
April  19-22,  1990.  Invited  talk  (William  Rundell). 

Other  activities  during  the  contract  period 

William  Rundell  was  the  Chairman  for  the  AMS/SIAM/IMS  Summer  Conference  on  “Inverse  prob¬ 
lems  in  Partial  Differential  Equations"  held  at  Areata  California,  July  26,  August  4  1989.  Approx¬ 
imately  70  people  attended  the  workshop  style  conference,  there  were  15  main  speakers  covering 
as  wide  a  range  of  topics  as  possible  within  the  framework  of  the  title.  Funding  for  the  conference 
was  by  NSF  and  ARO. 

Equipment  and  personnel  available  for  continued  research. 

This  fall  we  were  fortunate  in  being  able  to  hire  two  new  assistant  professors  who  will  actively 
participate  in  the  general  area  of  inverse  problems,  and  will  add  considerable  expertise  to  our 
efforts. 

Richard  Fabiano,  a  student  of  John  Bums,  spent  3  years  at  Brown  working  with  Tom  Banks. 
His  speciality  is  in  control  theory,  in  particular  for  viscoelastic  equations,  but  he  is  interested  in 
pursuing  the  not  unrelated  area  of  undetermined  coefficient  problems.  His  knowledge  of  the  output 
least  squares  methods  for  distributed  parameter  systems  will  be  very  useful  in  this  regard. 

Bruce  Lowe  was  a  student  of  Robert  Kohn  and  graduated  from  Courant.  He  works  in  numerical 
schemes  for  undetermined  coefficient  problems  using  variational  methods. 

There  are  three  Ph.D.  students  beginning  work  in  this  area  under  our  direction. 
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We  have  excellent  computational  resources  at  our  disposal.  These  include  an  IRIS  3130,  MIPS 
120,  MIPS  M2000,  two  SUN  and  two  VAX  workstations.  The  IRIS  graphics  workstation  and  the 
M2000  were  obtained  through  NSF  equipment  grants. 

The  current  ONR  grant  is  the  primary  means  of  funding  for  this  research  effort.  We  believe 
substantial  progress  has  been,  and  will  continue  to  be  achieved  in  the  area  of  inverse  coefficient 
recovery  problems  for  partial  differential  equations. 


Referenced  publications  by  the  investigators. 

[1]  Pilant,  M.  and  Rundell,  W.  Undetermined  coefficient  problems  for  nonlinear  elliptic  and 
parabolic  equations,  in  Inverse  Problems,  Birkhauser,  later.  Ser.  Numer.  Math.,  77,  139-154, 
(1986). 

[2]  Pilant,  M.  and  Rundell,  W.  Iteration  Schemes  for  Unknown  Coefficient  Problems  Arising  in 
Parabolic  Equations,  Numerical  Methods  in  P.D.E. ,  3,  (1987),  313-325. 

[3]  Pilant,  M.  and  Rundell,  W.  An  inverse  problem  for  a  nonlinear  elliptic  equation,  S.I.A.M. 
Journal  of  Mathematical  Analysis,  18,  (1987),  1801-1809. 

[4]  Pilant,  M.  and  Rundell,  W.  Fixed  Point  Methods  for  a  Nonlinear  Parabolic  Inverse  Coefficient 
Problem,  Communications  in  P.D.E.,  13,  (4),  (1988),  469-493. 

[5]  Pilant,  M.  and  Rundell,  W.  An  Iteration  Method  for  the  Determination  of  an  Unknown  Bound¬ 
ary  Condition  in  a  Parabolic  Initial-Boundary  Value  Problem,  Proceedings  of  the  Edinburgh 
Math.  Soc.,  32,  (1989),  59-71. 

[6]  Pilant,  M.  and  Rundell,  W.  A  uniqueness  theorem  for  determining  conductivity  from  overposed 
boundary  data,  Journal  Mathematical  Analysis  and  Applications,  136,  (1),  (1988),  20-28. 

[7]  Pilant,  M.  and  Rundell,  W.  Recovery  of  an  Unknown  Specific  Heat  by  Means  of  Overposed 
Data,  Numerical  Methods  in  P.D.E. ,  to  appear. 

[8]  Pilant,  M.  and  Rundell,  W.  Multiple  undetermined  coefficient  problems  for  quasi-linear  para¬ 
bolic  equations,  Numerical  Methods  in  P.D.E.,  to  appear. 

[9]  Pilant,  M.  and  Rundell,  W.  Identification  of  Nonlinear  Flux  Terms  in  a  Parabolic  Initial- 
Boundary  Value  Problem,  submitted  to  Mathematics  of  Computation. 

[10]  Rundell,  W.  Determining  the  birth  function  in  an  age  structured  population,  Mathematical 
Population  Studies,  4,  (1989). 

[11]  Pilant,  M.  and  Rundell,  W.  Determining  the  initial  age  distribution  for  an  age  structured 
population,  submitted  to  Mathematical  Population  Studies. 

[12]  Pilant,  M.  and  Rundell,  W.  Determining  a  coefficient  in  a  first  order  hyperbolic  equation, 
submitted  to  S.I.A.M.  Journal  Applied  Math.. 

[13]  Rundell,  W.  and  H.  M.  Yin,  A  parabolic  inverse  problem  with  an  unknown  boundary  condition, 
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Abstract: 


In  this  paper  we  consider  a  class  of  inverse  problems  in  which  an  unknown  function,  c(-),  is  to  be  determined 
from  a  parabolic  initial-value  problem,  with  overposed  Dirichlet  data  along  a  portion  of  the  boundary.  A 
mapping  between  the  overposed  data  and  the  unknown  coefficient  is  obtained  in  the  form  of  a  singular  integral 
equation.  This  is  solved  by  iteration,  and  the  resulting  fixed  point  is  shown  to  be  the  solution  of  the  inverse 
problem.  Sufficient  conditions  for  convergence  of  this  method,  as  well  as  an  extension  to  the  case  of  an 
unknown  thermal  conductivity,  are  given. 


I.  Introduction 


In  this  paper  we  consider  a  class  of  inverse  problems  in  which  an  unknown  function,  c(  ) ,  is  to  be  determined 
from 


with  initial  data  given  by 
and  boundary  data  given  by 


c(u)u,  -  u„  =  y(x,  t)  in  [0, 1]  x  [0,  T\ 
«(i,  0)  =  «o(x)  in  [0,1] 

«*{0,t)  =  g0(t)  us(\,t)  =  gfx(f) 


(1.1) 

(1.2) 

(1.3) 


The  functions  y ,  uo ,  and  gi  are  given  data. 

If  c  were  known  (and  sufficiently  regular),  then  (1.1)  -  (1.3)  would  be  a  well  posed  initial  boundary-value 
problem  for  the  determination  of  u(x,t).  In  order  to  recover  the  pair  (u,c)  we  need  to  give  additional 
information  and  we  do  this  in  the  form  of  Dirichlet  data  along  part  of  the  boundary;  specifically,  one  is  given 
the  values  of  u(0,f)  for  0  <  t  <  T . 

The  recovery  of  unknown  coefficients  whose  argument  is  the  dependent  variable  of  the  equation  has  many 
important  applications  and  considerable  recent  effort  has  been  invested  in  these  problems.  In  particular, 
the  question  of  when  a  given  amount  of  overposed  data  is  sufficient  to  uniquely  determine  the  coefficient 
has  received  a  fair  amount  of  recent  attention,  [2,3,4,6-12].  There  are  fewer  constructive  existence  results, 
not  because  of  their  lack  of  importance,  but  rather  because  of  their  inherant  difficulty.  In  previous  work  [8], 
the  authors  have  shown  how  to  recover  the  function  /  in  u«  —  ucz  =  /(u)  by  giving  overposed  Dirichlet 
data  in  the  same  fashion,  and  have  shown  how  to  recover  h  in  a  nonlinear  radiation  boundary  condition 
du/d v  =  h(u)  for  the  heat  equation  by  prescribing  the  value  of  u  at  a  point  on  the  boundary  [9].  Numerical 
implementations  of  these  algorithms  appear  in  [10]. 

In  this  paper  the  method  of  attack  on  the  problem  will  be  similar  to  that  used  in  [8].  The  main  complication 
is  the  fact  that  the  unknown  coefficient  multiplies  one  of  the  principal  terms  in  the  operator.  We  must  ensure, 
for  example,  that  c  >  0  at  all  times.  We  shall  prove  that  this  is  indeed  the  case,  under  suitable  conditions 
on  the  data.  Furthermore,  we  shall  demonstrate  that  c(u)  can  be  recovered  by  an  iteration  scheme,  and  give 
sufficient  conditions  for  the  unique  solution  of  the  inverse  problem. 

Equation  (1.1)  describes  the  evolution  of  the  temperature  in  a  homogeneous  rod  of  constant  thermal  con¬ 
ductivity,  but'  in  which  the  specific  heat  depends  on  the  temperature.  The  forcing  term,  y ,  corresponds  to 
known  temperature  sources  (or  sinks).  Thus  the  direct  problem,  namely  (1.1)  -  (1.3)  for  a  given  coefficient  c 
is  inherently  a  quasilinear  evolution  equation  for  u(x,  t).  For  the  one-dimensional  case  discussed  above,  the 
temperature  flux  du/dx  is  known  at  both  ends  of  the  rod.  The  temperature  at  one  end  is  measured  and 
provides  the  additional  (overposed)  data  for  the  problem.  This  type  of  problem  is  typical  of  problems  where 
the  dependence  of  material  properties  is  not  known  in  advance,  and  must  be  deduced  by  experiment.  We 
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set  up  a  experiment  in  which  the  thermal  flux  is  controlled  at  the  boundaries  and  then  watch  the  resulting 
temperature  behavior.  Recovering  the  material  property  is  equivalent  to  showing  that  there  is  at  most  one 
specific  heat  which  can  give  rise  to  the  overposed  data,  and  that  one  can  in  fact  construct  the  unknown 
function  from  knowledge  of  the  overposed  data. 

In  fact  the  methods  of  this  paper  go  through  formally  for  the  case  of  n  space  variables  since  the  resulting 
estimates  have  multidimensional  parallels,  but  the  analysis  is  more  cumbersome.  If  Q  is  a  fixed  bounded 
domain  in  Rn  with  smooth  boundary  dQ  and  xq  is  some  fixed  point  in  dQ  then  the  corresponding  problem 
is  to  recover  the  pair  c  and  u  from 


c(u)ut  -  Au  =  7(1,1)  infix[0,T] 


u(x,0)  =  uq(x)  in  H 
du 


on  dQ. 

with  a  measurement  of  overposed  data  consisting  of  the  temperature  at  the  point  xq 


dZ  =  9 


u(x0,t)  =  0(t). 


In  section  5.  we  consider  a  similar  problem  to  the  above,  namely  the  recovery  of  the  thermal  conductivity 
k(u)  in 

u,  -  V  •  (i(a)Vu)  =  7(z,f). 


II.  Mathematical  Preliminaries 
We  shall  consider  the  problem 


c(u)ut  —  uxx  =  7  (x,t)  0  <  x  <  1,  <  >  0  (2.1) 

ue(0,f)  =  0,  ux(M)  =  Si(0  (2.2) 

u(x,0)  =  uo(z)  (2.3) 


with  overposed  Dirichlet  data 


u(O,t)  =  0(t). 


(2.4) 


We  define  the  direct  problem  to  be  (2.1)-(2.3)  for  a  known  c,  and  denote  by  u(x,t;c)  the  solution  of  this 
initial-value  problem. 

We  will  make  the  assumption  that  the  overposed  data  6{t)  is  a  strictly  montone  function  of  t  for  all  t  >  0. 
In  addition  we  have  the  following  constraint,  arising  from  the  consistency  of  the  data 


c(0(O))0'(O)  -  «o(0)  =  7(0,0) 


(2.5) 


which  determines  the  value  of  c(0(O))  uniquely.  By  rescaling  the  time  variable,  we  may  assume  that  c(0(O))  = 
1.  Denote  by  0(x,f)  the  function  u(x,f;  1),  so  that  ip  is  the  solution  of 

'Pi  -  'Pxx  =  y(x,  t) 

Tpx(0,t)  =  Q,  1px(l,t)  =  Sl(f) 

^(x,0)  =  «o(x). 
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(2.6) 

(2.7) 

(2-8) 


If  c(u)  «  1,  then  0(f)  «  V*(0,  i) .  The  deviation  of  u(r,f;c)  from  rp  measures,  in  some  sense,  the  size  of  the 
nonlinear  term  c(u) . 

Some  comments  might  be  made  at  this  point.  If  usx  is  negligible,  then  we  have  from  (2.1)  c  ss  y/ut  which 
implies  that  as  c  increases  then  ut  decreases.  We  can  recover,  to  leading  order,  the  behavior  of  c  by 
computing  y(0,t)/0'(t) .  In  fact  if  the  boundary  data  is  such  that  ti**(0,f)  =  0  then  the  function  c  can 
be  recovered  directly  from  c(0(f))  =  7(0,  t)/6'(t) .  Although  the  idea  of  a  perturbation  approach  about  the 
“flat  data”  case  is  appealing,  numerical  results  indicate  that  unknown  coefficients  with  large  variations  (even 
discontinuities)  can  be  recovered  efficiently. 

We  will  need  some  notation.  With 


Dt:=  {(z,f)  |  0  <  x  <  I,  0  <  t  <  T} 


(2.9) 


we  define  several  norms  and  seminorms 


||  U  ||eo=SUp|u(l,t)| 
Dt 


(2.10) 


|«(-,<)li  =  sup 


\t-n\ 


II  /  ||oo=  sup  |/(£)| ,  £  €  dom(/) 

I  ft  _  l/(£ )  /(*?)!  r  _  j /  t\ 

|/|i  =  sup - — - -  £,r?€dom(/) 

K  -  ’ll 


II  /  11=11  /  lU  +|/|i 


(2.11) 

(2.12) 

(2.13) 

(2.14) 


The  set  of  all  functions  /  defined  on  the  set  X  with  ||/||  <  00  we  denote  by  Lipi(X).  We  will  also  use  the 
notation  /  6  Lipi  if  the  set  X  is  understood. 


The  role  of  the  time  variable  is  one 
solutions  of  parabolic  equations 


of  the  complicating  factors.  We  in  fact  avoid  the  usual  semi-norm  for 

N60  -«(»?.  r)l 


|u|a  =  sup 


is  K-il’  +  H-rQ-'’’ 


since  the  appearance  of  t  in  such  an  asymmetric  way  is  one  of  the  major  obstacles  faced  in  formulating  the 
mapping  between  9  and  / . 


We  will  also  have  occasion  to  use  the  notation 


«(-,<)[  =  u(y,t)  -u(0,f). 


For  a  fixed  E  <  1,  define  the  subset  Be  to  be  the  ball  of  radius  E,  in  Lip\ ,  centered  on  the  function 
identically  1 .  That  is, 

Be  =  { c |  c  6  Lipi[0, 00),  ||1  -  cj|  <  E}.  (2-15) 

We  remark  that  any  function,  c,  defined  on  Lipi[0,T]  can  be  extended  to  a  function  on  Lip\ [0, 00)  by 
continuation  as  a  constant,  without  increase'  in  norm.  This  continuation  will  be  used  later. 

If  c  6  Bt  for  some  6  <  1 ,  we  have  |1  -  e|  <  6  and  hence  0<1  —  $<|e|<l-M<2  and  c  is  uniformly 
bounded  away  from  c  =  0 ,  which  is  necessary  for  the  iteration  scheme  to  remain  well  defined. 

We  shall  denote  by  C  a  generic  constant  which  depends  on  the  domain,  but  is  independent  of  the  solution 
u(z,t;c),  and  we  denote  by  C(E)  a  generic  constant  which  depends  on  u  only  through  E . 
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Finally,  we  say  the  c  €  Be  has  property  S  if 

||c(ti)  -  c(w)||  <Ce||u-t>|| 

for  some  constant  Cg  <  oo .  This  property  holds,  for  example,  if  c  €  CJ . 

We  will  also  require  the  following  assumptions  on  the  data  and  on  c 

Al.  (i)  u0  G  C3[0, 1] ,  (ii)  gi(t)  €  CJ[0,oo),  (iii)  y(x,t)  €  C^O,  1]  x  C^O.oo). 

A2.  0(f)  is  a  monotone  function,  whose  derivative  lies  in  C^O.oo)  and 

inf  |0'(f)|  >  6  >  0. 
i>o  ~ 

A3,  c  €  Be  for  some  E  <  1 . 

A4.  The  function  9  is  such  that  there  exists  a  c  €  Be  with  u(0,t;c)  =  0(f). 

Some  remarks  on  these  assumptions  are  in  order. 

Remark  1:  Assumption  Al  is  sufficient  to  ensure  that  u  is  sufficiently  regular  and  generates  smooth 
boundary  data.  A2  is  necessary  to  recover  c  from  knowledge  of  c(9(t )) ,  and  to  ensure  compatibility  with 
some  c  6  Lip\  function  generating  the  data.  A3  is  sufficient  to  ensure  that  the  problem  remains  strictly 
parabolic.  Finally,  A4  is  necessary  in  order  to  ensure  that  a  solution  to  the  problem  exists. 

Remark  2:  In  the  case  where  u  is  spatially  independent,  the  direct  problem  becomes 

u,  =  [c(u)]"17(f) 

and  this  would  not  necessarily  possess  a  unique  solution  unless  c(  )  were,  for  example,  Lipschitz.  Uniquess  of 
the  direct  problem  is  necessary  in  order  that  our  algorithm  be  well  defined.  To  show  uniqueness  of  the  inverse 
problem  we  actually  need  to  impose  further  constraints  on  the  solution  c,  namely  that  it  have  property  5. 


III.  The  Boundary  Coefficient  Mapping 

Evaluating  (2.1)  on  the  overposed  boundary,  and  using  the  overposed  data  (2.4),  we  observe  that  any  strong 
solution  of  (2.1)-(2.3)  and  (2.4)  must  satisfy 

-  u„(0,f;c)  =  7(0,  f).  (3.1) 

Define  the  mapping  T[c]  by 

TH  =  (3.2) 

where  u(x,f;c)  is  the  solution  of  (2. 1)— (2.3)  for  a  prescribed  c(-).  Any  solution  of  the  inverse  problem 
therefore  satisfies 

T[c](«)  =  c(9(t)).  (3.3) 

We  call  such  a  function  a  0 -fixed  point  of  T .  The  natural  iteration  scheme  introduced  is  clearly 

c<"+1>(0(<))  =  T[c(">](<).  (3.4) 

If  u<n>  s  u(x,t;cf-n >)  >  9{T)  then  we  define  c^n^(u^"))  =  c^"^(0(T))  and  similarily  if  the  argument  is  less 
than  0(0) .  This  extends  in  a  Lipschitz  continuous  manner  without  increasing  its  norm.  This  guarantees 
that  the  scheme  is  well  defined  even  if  it  should  happen  that  one  of  the  iterates  should  lie  outside  of 
the  range  of  values  of  the  overposed  data. 
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Note  that  8{t)  must  be  monotone  for  one  to  recover  c^n+1)  from  (3.4).  As  we  remarked  previously,  in  order 
for  this  scheme  to  be  well  defined,  we  must  have  uniqueness  for  the  forwards  or  direct  problem.  We  will  show 
that  T[cj  maps  a  certain  ball  in  Be  into  itself.  If  the  data  is  sufficiently  smooth  (satisfying  Assumptions 
Al,  A2,  A3),  and  the  initial  starting  value  lies  in  Be  ,  then  it  is  guaranteed  that  each  iterate  will 
yield  a  unique  u(x,<;c("))  to  the  direct  problem. 

The  outline  of  the  rest  of  the  paper  is  as  follows.  Lemma  1  shows  that  the  inverse  problem  can  be  reduced 
to  finding  a  fixed  point  of  the  mapping  T[  ].  Theorem  1  shows  that  T[  ]  maps  Be  to  itself  which  implies 
that  the  iteration  scheme  is  well  defined.  In  Theorem  2,  using  an  additional  regularity  assumption,  we  show 
that  there  is  a  unique  fixed  point. 

The  iteration  scheme  (3.4)  can  be  recast  in  the  form 


c("+1)(0(t))  = 


u„(0,  t ;  c(">)  +  7(0,0 


m 

1 

u,(n)(0,t;c) 

c(n>(tin(0,f)) 

9>(t) 

c(n\9(t))  + 

0'(t) 

(3.5) 


+  [c<"V"Ho,0)-c(n)(*(0)] 

=  c*B>(0(*))  +  ^„(u(n)(0,  t)  -  0(t );  c<">) 


where  it  is  seen  to  be  equivalent  to  a  nonlinear,  nonstationary  residual  update  scheme.  In  order  to  show  the 
equivalence  of  5 -fixed  points  and  solutions  to  the  inverse  problem  we  prove  the  following  Lemma. 

Lemma  1.  If  (ti,  c)  is  a  solution  of  (2.1)-(2.3),  then  u  satisfies  the  overposed  boundary  conditions  (2.4)  if 
and  only  if  c  is  a  9 -fixed  point  of  T[  ] ,  defined  in  (3.2). 


Proof:  If  (u,c)  is  a  solution  of  the  equations  (2. 1)— (2.4)  c  is  a  0 -fixed  point.  This  is  clear,  since  the 
identity  c(9(t))0'(t)  -  uxx(0,t)  =  7(0,  t),  which  follows  from  (3.1),  implies  that 


cm)  = 


(0,0 +  7(0,0 

m 


—  T[c](0- 


On  the  other  hand,  suppose  c  is  a  0-fixed  point,  then 


c(0(t))0'(t)-uxx(O,t)  =  7(O,t) 


but  from  (2.1)  the  solution  u  satisfies 

c(u(x,  t))ut(x,  t)  -  uxx(x ,  t)  =  y(x,  t). 

Evaluting  this  last  expression  at  x  =  0,  and  subtracting  from  the  previous  expression,  we  obtain 


Therefore, 


c(0(O)0'(O-c(w(o,OMO,O  =  o. 

[c(0(O)  -  c(tt(0, 0)]  0'(t)  +  c(u(0, 1))  [0(f)  -  .1,(0,  o]  =  o. 
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Solving  for  0'(t)  —  ut(0,<)  we  have 


^(0  -  ut(0,<)  =  [c(u(0, 0)  -  c(^t))]c(7(o^))- 

By  Assumption  A3,  |c(u(0,<))l  >  0  and  by  A2  |0'(i)|  <  oo.  Consequently,  if  c  is  uniformly  Lipschitz; 

!*'(*)  -  «t«M)l  <  ||c||^2i_|g(0  -  «(0, 01- 


Setting  a(t)  =  9(t)  —  u(0,0 


|a'(0l  <  A/K0I- 


(3-6) 


Since  a(0)  =  0  (the  initial  boundary  data  and  overposed  data  are  consistent),  GronwalPs  inequality  yields 
|a(0l  =  0,  and  the  Lemma  is  proved. 

The  equivalence  of  0-fixed  points  and  solutions  of  the  inverse  problem  appears  to  be  a  general  result  for  this 
method  of  solving  single  inverse  coefficient  problems. 


(4.1) 


(4.2) 


IV.  Iteration  Procedure 

The  Green’s  function  for  the  heat  equation  on  the  unit  interval,  with  homogeneous  Neumann  boundary 
conditions,  has  the  form 

K(x,y,t)  =  G(x,y,t)  +  H(x,y,t) 

where 

G(x,y,t)  =  ^  {«-<*-»>*'«  +  e-(*+»>3/<‘} 

and  H  is  bounded,  C°°  -smooth  kernel  for  t  >  0,  (1). 

Rewriting  (1.2)  in  the  form 

ut  -  «**  =  tOM)  +  (l  -  c(u))u, 

and  subtraction  from  (2.6)-(2.8)  implies  that  the  function  v  —  u  —  tf>  must  satisfy 

Vt  -  t>r«  =  [1  -  c(w)]  u, 

ur(0,t)  =  0,  MU)  =  0 
t>(z,0)  =  0 

Consequently, 

u  -tpsv=  f  I  K(x,y,i  —  r)[l  —  c(u(y,  r))]u»(y,  r)dydr 
Jo  Jo 

Differentiating,  with  respect  to  x  twice,  and  setting  x  =  0 ,  we  obtain 

u„(0,<;c)  =  ^„(0,f)+  /  /  K„(0,  y,  f  -  r)  [1  —  c(u)]  u,  dydr 
Jo  Jo 

and  consequently 

T[c](t)  =  0_1(<)  [7(0,  t)  +  «»r(0,<)] 

=  ^OU)  +  \M<U)  +  ^  K„(0,y,t  -  r)[l  -  c(u)]utdydr 

=  0~1(t)  +  1  ~  T)U  “  c(u))u‘  • 

The  next  Lemma  is  crucial  to  the  estimates  that  will  follow. 


(4.3) 


(4.4) 


(4-5) 
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Lemma  2.  The  kernels  Kx  and  Kxx  have  the  property  that  for  any  fixed  xq  , 


ill  Kx(x,  y,  t  -  r)dyj  h(x0,  r)dr  =  0  t  >  0 
m  Kxx(x,  y,  t  -  r)dyj  h(x0,  r)dr  =  0  t  >  0 


for  x  €  [0, 1]  and  h  €  Lipi . 

Proof:  Fix  xq,  and  set  /(<)  =  h(x0,t) .  The  function 


u>(x,t)=  f  f  K(x,y,t  —  r)f(r )dydr 
Jo  Jo 


satisfies  the  following  boundary  value  problem: 

wt  -  wxx  =  /(f) 

u,,(0,t)  =  0  u/*(l,f)  =  0 

w(x,  0)  =  0. 

w  is  therefore  a  function  of  f  alone,  hence  wx  =  0  and  wxx  =  0 . 

This  lemma  implies  the  useful  identities 

nKx(x,y,t  ~  r)[h{y,r)  -  h{x0,r)\dydr  -  /  /  Kx(x,y,t  -  T)h(y,T)dydr 

Jo  Jo 

and  «  l  ‘i 

/  /  Kxx(x,y,t  -  r)[h(y,r)  -  h(x0,r)]dydT  =  f  f  K„(x0,y,t  -  r)h(y,r)dydT 

Jo  Jo  Jo  Jo 

which  will  be  used  to  simplify  the  nonlinear  estimates  which  follow. 

The  first  step  will  be  to  show  that  T  maps  a  ball  in  Lip\  to  itself  on  the  space  Be  ,  that  is,  we  will  show 
that  if  E  <  1  is  sufficiently  small 

111  -  T[c]|I  <  ||1  -  c||  (4.8) 

for  c  €  Be  •  Note  that  from  (4.5), 

T(c]-l  =  [^(0,f)-^(f)]/[fi/(f)] 

+  /  [  KXX(Q,  y,  f  -  r)[l  -  c(u)]u,  dydr  /  [fi'(f)] 

Jo  Jo 

l|T[c]  -  ill  <  111/^(01111  [*(0, 0-^(0111 

+  ||l/^/(t)||  1/  tfr,(0,y,t-r)(l  -  c(u))u,  dydr 

We  must  carefully  estimate  the  Lip\  norm  of  the  operator  K ,  defined  by 


A(*,f)  =  K[h]  =  /  /  Kxx(x,y,t-T)h(y,T)dydT 

Jo  Jo 


(4.10) 
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on  the  boundary  x  =  0. 


Suppose  that  h(x,t)  =  x,  we  have  (with  some  easy  computations) 
r*  /-i  r*  fi 


f  /  Kxz{x,  y,  t  -  r)h(y,  r)dydr  =  [  f  /fr*(x,  y,  t  -  r)ydydr 
Jo  Jo  Jo  Jo 


We  see  that  Lipschitz  functions  in  x  are  mapped  to  functions  in  C1^2  with  respect  to  t .  This  implies  that 
a  function  which  is  Lipi  with  respect  to  x  and  t  is  mapped  to  one  which  is  Lip\  in  x  and  only  C1^2  in  t , 
apriori. 


It  is  precisely  this  loss  of  regularity,  in  t ,  which  makes  the  analysis  so  much  more  delicate.  The  iteration 
scheme  is  not  a  priori  well  defined  unless  there  are  additional  conditions  present  which  restore  the  range  of 
K  to  be  a  subset  of  Lipi  in  the  time  variable.  The  interaction  between  the  nonlinearity  and  the  overposed 
boundary  has  the  effect  of  restoring  the  smoothness.  A  similar  phenomenon  may  be  found  in  [8]. 

Using  (4.7)  we  have  the  identity 

Kxx( 0,  y,  t  -  r)[l  -  c(u)]u«(y,  r)dydr 

=  [  [  ^r*(0,y,<-r){[l  — c(u(y,r))]ut(y,T)  — [1  — c(u(0,r))]u,(0,  r)}dydr 
Jo  Jo 

-  Jo  J  A’«(°-y><-r)[1-c(tt(y.r))](u«(y.i')  -««(0,r)]dydr 

+  /  /  ^(O.y.t-rJWa^.r^-cKy.rJJjut^.rJdydr.  (4.11) 

Jo  Jo 

To  estimate  the  three  differences  appearing  in  the  integrands  of  (4.11)  we  shall  require  Lemmas  3-5: 


Lemma  3.  |c(u(z,  t))  -  c(«( 0,  t))|  <  C'|c|i||t*»*||0O  jar|2. 


Proof:  The  proof  of  this  Lemma  is  a  direct  application  of  Taylor’s  Theorem,  and  the  fact  that  ur(0,<)  =  0. 

W«(*.  0)  -  c(«(°.  0)1  <  kliN*.  0  -  «(o>  01 

<  |c|ij«r(0,0*  +  «**(£.  0l2/2|  for  some  0  <  £  <  1 

<  C|c|1||u„i|0O|*|2. 

Lemma  4.  |1  -  c(u(*,t))|  <  C||l  -  c(u)||  {||«er||00|a:|2  +  ||w<lloo|<|}  • 

Proof:  Using  a  Taylor  series  expansion  of  c(u(x,<)),  we  conclude 

1 1  -  c(u(x,  t))|  =  |1  -  c(u(0,  f))  +  c(u(0,  t))  -  c(ti(x,  t))| 

<  |1  -  c(«(0,f))|  +  |c(u(0,f))  -  c(u(*,t))| 

=  |c(u(0, 0))  -  c(u(0,  <))|  +  |c(u(0,  t))  -  c(u(x,  f))| 

<  |c|i|««(0,  )<|+  |c|l|Axau««(-,0l 

<  C|c|i  {|«||oof°  +  |u*«|ooX2} 

which  proves  the  lemma  since 

Mi  <  Mi  +  111  -  e||,o  =  ||1  -  c||oo  +  |1  -  cU  =  ||1  -  c|| 

Lemma  5  is  a  fundamental  a  priori  estimate  for  the  direct  problem. 
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Lemma  5.  Given  c  €  Be  ,  we  have  for  E  sufficiently  small; 

\ut(x,t)\<C(E) 

\uxx(x,t)\<C(E) 


\ut(x,t)  -  ut(0,t)\  <  C(E)\x\. 

Proof;  From  [5,  Section  4.5],  we  have  the  fundamental  estimates  that  the  solution  of 


t it  -  u«*  =  f(x,t) 


satisfies 


IK,  *,,||  <  Ci  +  Ca||/l| 

where  C\  depends  only  on  the  initial-boundary  data.  With  /  =  (1  -  c(u))utl  we  are  led  to  the  estimate 


which  leads  to 


INI  <Ci  +  C2 1|(1  -  e(u))iit||  <  Ci  +  C2||l  -  c(u)||||u,||  <  Cx  +  C2£||tit|| 


INI  < 


-  1  -  C2E  ’ 

The  Lemma  follows  immediately  from  the  definition  of  the  norm. 

Using  these  lemmas  and  referring  to  remark  2,  we  conclude  that 

Theorem  1.  Under  the  assumptions  on  the  data  given  by  A1 — A4,  and  if 

£STF(5j?  tl  +  l<'<0)11 

and 

(  V(0)l2 

satisfy  e  <  (1  -  6)E ,  then  for  t  <T  and  E  sufficiently  small,  T[  ]  maps  Be  into  itself.  C(E)  is  a  constant 
which  depends  continuously  on  E . 

Proof:  Estimate  each  of  the  integrands  in  (4.11)  using  Lemmas  4,  5,  and  3,  respectively  to  obtain: 
i/7  K„(0,  y,  t  -  r)[l  -  c(u)]u«(y,  r)dydr\ 

-lie  \Kxt(0’y,t  ~  r>l{lcl1  ‘  C  [IIWloo!/2  +  ll«*ll~r]  C(£?)|y|  (4-12) 


•  C(^)|c|i||ux,||oo|y|J|dydr. 


In  order  to  simplify  this  expression  we  use  the  estimates: 


/  f  \Kxx(0,y,t  -  T)\\y\adydT  <  c(a)t“/2 
Jo  Jo 

f  / 1  !/f.r(0,y,f  -  r)|r*|y|0dydr  <  c(o)^+“/J 
Jo  Jo 


9 


Taking  a  =  1  and  0  =  1  in  the  above  estimates,  (4.12)  reduces  to 


Ijfjf  K*x(0,  y,  <  -  r)[l  -  c(ti)]tif(y,  r)dydr| 

<  |c|i  •  C(E)  [||u„||00]  [t3/2  +  t\+  |c|1C(£)||ut||0ot3/2 

<  |c|1C'(JE7)||«„||0Oi  +  IcIiCC^XIIu^Hoo  +  ||«t||oo)<3/2 

<||i-c||[c(£:)|K,||0Ot+o(t3/2)]. 

Hence,  from  (4.9),  and  the  above  inequality,  we  have 


111  -  TWIU  < 


WMW-mW* 


inft>0  |0'| 


In  order  to  compute  the  Lipschitz  norms  of  c(8(t)) ,  we  use  the  fact  that 

u,  _  SUD  Wi))-e(g(ta))||f1-ta| 

<  1  8UD 
inf«>o  \0'\  *.*,  I*i-*al 

Using  a  similar  breakup  of  the  integrals  as  in  (4.11),  one  can  show  that 


1 1  tml<  ll[^~g>]11 
1  1Ul1-  (inft>o|0'|)2 


+  r 


l|l-c|| 


inf, 


t>o 


^[Crau^ll^+O^/2)]. 


Combining  (4.13),  (4.14)  we  obtain 


+  {[c(£),infM  +  C(£)]iJ^fejJ}||l-cr 


=  £  +  6||1  -  cjl 


(4.13) 


(4.14) 


(4.15) 


For  t  <  T  sufficiently  small,  we  have  £  as  e  and  6  sb  6,  and  by  the  assumptions  on  e  and  6,  ||1  -  T[c]||  <  E 
and  the  Theorem  is  proved. 

This  is  sufficient  for  showing  that  T  :  Be  — *  Be  •  We  therefore  have  a  bounded  sequence  of  iterates  in 
Be  C  Lipi  with  a  subsequence  converging  in  C&  ,  strongly,  for  0  <  1. 

We  must  show  that  the  limit  is  unique  in  order  to  obtain  a  solution  to  the  inverse  problem.  We  do  this 
by  showing  that  Be  contains  an  attractor  for  T,  that  is,  there  exists  an  element  5  €  Be  such  that  c  is  a 
9 -fixed  point  and 

||T[c]-T[c)||<%-c||  S<  1 


10 


for  all  c  €  Be  ■  We  note  that 


T[c]  -  T[c]  =  [fl'COF1  /  /  K*x(0,  y,t  —  r)  [c(u)  -  c(u)]  u ,dydr 
Jo  Jo 

+  [0'(t)]~1  [  f  K„(0,y,t  -  r)[l  -  c(u)](ti,  -  ut)dydr 
J o  Jo 


=  P (Of*  |  j*  jj  (K.)  -  c( «)] r)  j 

+  L  Jo  Ktx  T^) 


dydr 


dydr 


■  Jq  Jq  ^*x^[l-c(u)][u,(  ,r)--tit(-,r)]^  dydr  j. 


The  only  difficult  term  to  estimate  is  the  second  integral  in  the  last  equality.  If  c  is  smooth  enough  so  that 
||c(u)  —  c(u)||  <  Cb||u  —  u|| ,  that  is  if  Property  S  of  section  2  holds,  then  we  may  conclude  that 


TO  -  T0II  <  [C(£)  Jbsbp  +  0(1)]  ||c  -  dl 


(4.16) 


where  C(E)  is  a  bounded  constant,  which  depends  continuously  on  E . 

Consequently,  if  t  <  T  is  sufficiently  small,  and  |uq|oo/0'(O)  is  sufficiently  small,  then  T[  ]  is  a  contraction 
on  Be  ,  with  respect  to  c . 

If  we  assume 

A4* .  The  function  8(t)  is  such  that  there  exists  a  c  6  Be  with  Property  S  such  that  u(0,t;c)  =  9(t) . 
We  then  have, 


Theorem  2.  If  Al— A3  and  A4*  hold  then  there  is  a  unique  solution  to  the  inverse  problem.  Furthermore, 
this  solution  c  can  be  obtained  as  the  limit  of  the  iteration  scheme  (3.4). 

We  remark  at  this  point  that  although  this  proof  of  a  unique  fixed  point  does  not  constitute  an  existence 
proof  (since  we  have  assumed  apriori  the  existence  of  a  c  giving  rise  to  the  overposed  data),  it  is  never¬ 
theless  constructive  and  yields  geometric  convergence  to  the  solution.  An  existence  proof  would  require  a 
characterization  of  the  class  of  overpoeed  data  which  is  the  image  (under  the  heat  operator)  of  Be  ■  This  is 
known  to  be  a  difficult  problem.  A  partial  explanation  is  that  C3  initial-data,  with  homogenous  boundary 
data  and  forcing  functions,  leads  to  data  u(0,t)  =  0(<)  which  is  analytic  in  t ,  and  can  be  represented  as  a 
Dirichlet  series  for  t  >  0 . 


V.  Recovering  an  unknown  thermal  conductivity. 

A  related  problem  to  the  one  considered  in  this  paper  is  the  recovery  of  the  conductivity  k(u)  from  the 
nonlinear  parabolic  equation 

-  V  ■  (k(u)Vu)  =  y(x,  t).  (5.1) 

The  direct  problem  consists  of  (5.1)  along  with  the  same  boundary  conditions  as  for  the  determination  of 
c(u),  that  is  initial  data  together  with  the  specification  of  the  flux  on  the  lateral  boundaries. 

*(«)l^  =  ?(0  (5-2) 
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u(x,0)  =  u0(*) 


(5.3) 


The  overposed  data  needed  to  recover  k(u)  will  again  be  given  by 

u(x0,t)  =  ^(t)  xo  €  <90  (5.4) 

In  one  space  variable  several  results  are  known  for  this  problem.  Uniqueness  of  a  solution  pair  (u,  k)  has  been 
shown  by  Cannon  and  Yin,  [3],  and  Cannon  and  DuChateau,  [2],  have  provided  conditions  that  guarantee  a 
minimum  to  the  quantity  ||u(0,  f ;  k)  —  0(t)||  in  an  appropriate  setting. 

The  problem  (5. 1)— (5.4)  has  many  similarities  to  (1.1)— (1.4).  However  the  strategy  of  evaluating  (5.1)  on 
the  overposed  section  of  the  boundary,  x  =  xo  leads  to  a  difficult  scheme  to  analyze  since  k(u)  is  implicitly 
defined  in  the  equation.  A  common  strategy  (see  for  example,  [2]),  in  dealing  with  this  situation  problem  is 
to  introduce  the  change  of  variables  v  =  k{r)dr .  In  the  special  case  7  =  0  and  uq  =  0  this  leads  to  the 
boundary  value  problem 

c(  v)vt  —  Av  =  0 

^  =  5(0  (5.5) 

v(x,  0)  =  0 

with  the  overposed  data 

u(xo,0=  /  k(r)dr  =  i>k(0) 

Jo 

where  e(v)  =  [fc(^j'(t;)]-1 .  Implementing  the  iteration  scheme  (3.4)  by  evaluating  (5.5a)  on  the  boundary 
point  x0  gives 

c„+i(tM*))  =  =  Av({x0’t:^  (5.6) 

M*0»  Cn) 

and  the  problem  is  reduced  to  a  problem  very  similar  to  (1.1)-(1.4). 

In  the  case  of  one  space  dimension,  another  strategy  is  possible.  Let  u(x,t)  satisfy 


with 


and  let 


u,  -  (*( u)ur)x  =  7 (*,<) 

(5.7) 

— t(u(0,f))ur(0,t)  =  g0(t) 

(5.8) 

*(«(!>  0)«*(M)  =  ffi(<) 

(5.9) 

u(x,0)  =  u0(x) 

(5.10) 

u(0,t)  =  9(t) 

(5.11) 

be  given  as  overposed  data.  If  we  define  u(x,t;k)  to  be  the  solution  of  (5.7,  5.8,  5.10,  5.11),  then  for  k  €  C7 
there  is  a  unique  strong  solution  to  the  direct  problem.  In  this  formulation,  we  are  considering  the  condition 
(5.9)  as  the  “overposed  data”  and  incorporating  the  measured  Dirichlet  data  (5.11)  into  the  direct  problem. 
On  the  boundary  x  =  0,  we  have  from  (5.9)  that 


k(0(t))ux(0,t\k)  =  gQ(t) 


(5.12) 
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and  this  leads  to  the  iteration  scheme  generated  by 


1 fc(0(t»  =  T<2)[lt]  =  - 


go(0 

«r(0  ,t-,k)' 


(5.13) 


Of  course,  in  order  for  (5.13)  to  make  sense,  we  must  ensure  that  ur(0,t;ifc)  is  uniformly  bounded  away 
from  zero,  but  this  can  be  achieved  by  giving  appropriate  conditions  on  the  data.  Numerically,  (5.13) 
has  superior  convergence  properties  with  respect  to  (5.6),  partly  due  to  the  compactness  of  T^2).  Many 
numerical  experiments  indicate  that  effective  convergence  of  the  scheme  in+i  =  7^[fcn]  is  obtained  in  a 
few  iterations.  This  last  scheme  has  many  similarities  to  that  used  in  [9]  to  recover  the  form  c  he  nonlinear 
boundary  condition  ux  =  h(u)  from  overposed  Dirichlet  data,  in  that  the  boundary  operator  rather  than 
the  second  order  partial  differential  equation  itself,  is  used  to  provide  the  update  scheme.  The  fact  that 
>ower  order  differential  operators  are  involved  in  the  update  scheme  appears  to  be  responsible  for  its  superior 
convergence  properties. 

In  a  higher  number  of  spatial  dimensions,  the  interchanging  of  the  overposed  and  primary  data  is  not 
possible  as  implemented  above,  but  the  following  overposed  boundary  value  problem  case  can  be  used  to 
obtain  recovery  of  k(u) 

ut  —  V  •  (i(u)Vu)  -  -  y 

u(x,t)  -  h(x,t)  x  €  dS 1  (5-14) 


u(x,0)  =  u0(x) 


as  the  direct  problem,  and  oveposed  data  given  as  the  value  of  the  heat  flux  at  a  given  boundary  point 


x0  €  dQ. 


X-Xo 


(5.15) 


Equation  (5.15)  leads  to  the  update  scheme 


mm  = 


(5.16) 


where  0(t)  =  h(x o,<)  is  known.  The  nonvanishing  of  the  normal  derivative  du/dv  at  the  point  x  —  xo  can 
be  guaranteed  by  imposing  the  condition  that  h(x,t)  <  h(xo,t)  (at  least  in  the  case  of  y  =  0  and  u0  =  0 
and  with  suitable  modifications  otherwise).  This  condition  is  necessary  to  guarantee  that  the  entire  range 
of  values  of  u(x,t)  is  contained  on  the  line  x  =  x0  where  the  overposed  boundary  condition  is  imposed. 
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1.  Introduction. 

Consider  the  following  initial  -  boundary  value  problem 


xx  =g(x,t) 

0  <  x  <  1,  0  <t  <T 

(i.i) 

u(x,0)  =  u0(x) 

0  <  x  <  1 

(1.2) 

ux(0,t)  =  /(u(0,t)) 

(1.3a) 

-“*(1,0  =  /(«(M» 

(1.36) 

Given  sufficient  smoothness  on  and  knowledge  of  the  function  /(«),  the  above  direct  prob¬ 
lem  has  a  unique  solution  for  u(x,  t) .  However,  we  envision  the  situation  where  the  function 
/  is  unknown,  save  that  it  depends  only  on  u,  and  wish  to  determine  both  u(x,t)  and 
f(u)  by  making  additional  boundary  measurements.  The  solution  of  the  direct  problem 
for  a  given  function  /  will  be  denoted  by  u(x,t;/).  For  further  discussion  of  the  direct 
problem  and  the  modeling  of  such  boundary  conditions,  see  the  monograph  [1]. 

One  of  our  assumptions  will  be  that  f(u)  has  fixed  sign  over  its  range,  and  without 
any  loss  of  generality,  we  assume  that  it  is  nonnegative.  Given  this,  additional  assumptions 
are  made  on  the  data  so  that  heat  is  being  lost  through  the  boundaries  at  x  =  0  and  x  =  1 , 
and  the  temperature  u(x,  t)  is  decreasing  in  time  for  each  position  x . 

Finally,  overposed  data 

u(0,t)  =  h(t)  0  <  <  <  T.  (1.4) 

is  prescribed,  and  for  compatibility  this  must  be  monotone  in  t.  It  will,  in  fact,  be 
monotonically  decreasing. 

Equations  (1.1)  -  (1.3)  describe,  for  example,  the  diffusion  of  heat  in  a  uniaxial  bar  with 
nonlinear  radiation  boundary  conditions  at  the  ends,  our  problem  is  therefore  to  determine 
the  unknown  temperature  -  dependent  radiation  function  /(u)  from  a  knowledge  of  the 
initial  temperature  distribution,  and  a  measurement  of  the  temperature  u(x,  t)  at  one 
boundary. 

If  the  heat  loss  occurs  through  boundary  conduction,  we  may  assume  that  the  flux 
is  a  function  of  the  temperature  difference.  One  commonly  used  model  specifies  a  Unear 
relationship,  (Newton’s  law  of  cooling) 

du 

_  =  -*(«-««). 
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(Here,  du/du  denotes  the  derivative  normal  to  the  boundary.)  On  the  other  hand,  black 
body  radiation  into  a  vacuum  (with  zero  ambient  temperature)  is  governed  by  Stefan’s  law 


du 

dv 


=  <TU 
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where  a  is  Boltzmann’s  constant,  If  the  body  is  partially  absorbing,  then  we  have 

du  ± 


dv 


=  ecu 


where  e  is  the  emissivity  of  the  surface.  If  the  emissivity  is  unknown,  but  temperature 
dependent,  we  have  boundary  conditions  of  the  form  (1.3).  If  heat  is  lost  through  the 
boundary  through  both  radiation  and  conduction,  then  we  must  assume  a  more  general 
form  of  the  law  -  as  in  (1.3). 

This  leads  to  an  inverse  problem  for  the  unknown  dependence  of  the  flux  on  temper¬ 
ature.  We  can  also  view  this  as  a  problem  in  nonlinear  boundary  control.  If  the  boundary 
satisfies  a  known  radiation  law 

fin 

!  =  *<">'  P-») 

but  the  incoming  flux  Q  is  temperature  dependent  and  unknown,  we  have 

=  h(u)  +  Q(u)  =  /(u).  (1.6) 

We  now  wish  to  control  the  temperature  of  the  rod  to  the  extent  permitted  by  controlling 
the  temperature  at  one  end.  We  seek  to  do  this  by  controlling  the  total  flux  there.  The 
target  set  is  the  desired  temperature  response  {/*(<;)}  and  the  unknown  function  Q(-)  is 
the  control.  We  show  that  for  any  target  set  of  monotone  data,  a  piecewise  linear  function 
/  exists  with  the  property  that  the  solution  u(x,t,f)  to  (1.1)— (1.3)  with  /  =  /  satisfies 
“(0  ,<>>/)  =  Htj)- 

At  best  we  will  only  be  able  to  recover  the  function  /(u)  for  those  values  taken  on  by 
u(0,<)  for  0  <  t  <  T.  It  is  thus  necessary  that  the  range  of  values  of  the  function  u(x,t) 

for  x  €  [0, 1]  be  contained  in  the  range  of  values  of  u(0 ,t)  for  0  <  t  <  T . 

There  are  several  special  cases  of  the  inverse  recovery  problem.  First,  if  the  boundary 
conditions  on  x  =  1  are  known  (and  axe  independent  of  /),  for  example  au*(l,<)  + 
0u(l,t)  =  fifi(t),  then  the  direct  problem  may  be  solved  by  using  the  overposed  data  (1.4) 
and  the  known  boundary  condition  on  x  =  1.  The  flux  boundary  conditions  on  x  =  0 
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can  then  be  used  to  recover  /.  Similarly,  if  two  overposed  boundary  measurements  are 
used,  one  can  recover  two  possibly  different  flux  functions  f\  and  f% .  This  also  shows 
that  the  problem  of  recovering  a  single  flux  function  by  means  of  two  measurements  is 
overdetermined . 

In  addition,  we  require  the  overposed  data  to  be  monotone  in  t .  This  monotonicity 
property  is  essential  to  recover  the  function  /  from  a  knowledge  of  f(h).  No  attempt  to 
completely  characterize  the  class  of  allowable  overposed  data  h(t)  will  be  made.  Instead 
we  assume  that  for  some  function  /(u) ,  the  data  u(0,  <;  /)  is  given,  and  show  that  the 
unknown  radiation  function  can  be  reconstructed  from  the  overposed  data.  More  precisely, 
we  prove  under  certain  conditions  that  if  the  value  of  h  is  given  at  a  sequence  of  points  tj 
for  j  =  1,2, . . . ,  JV,  then  there  exists  a  piecewise  linear  function  Jn  with  slope  changes  at 
the  points  h(tj),  such  that  u(0,ty;/jv)  =  h(tj),  and  lim  ||/  — /jv||.„  =  0.  We  define  the 
residual  of  the  mapping  from  h  —*  f  as  u(0,  t;  f )  —  h(t) .  Requiring  the  residual  to  vanish 
at  a  discrete  set  of  points  tj ,  generates  a  collocation  scheme. 

In  a  recent  paper  [6],  the  authors  proved  the  existence  of  a  unique  solution  to  (1.1)  - 
(1.4).  The  solution  of  the  inverse  problem  was  obtained  by  an  iteration  scheme  using  the 
boundary  condition  itself  as  the  update  algorithm.  Although  the  update  scheme  required 
little  computational  cost  to  implement,  each  iteration  involved  solving  a  (nonlinear)  direct 
problem  of  the  form  (1.1)  -  (1.3),  and  required  the  data  to  be  fairly  noise  free.  The 
method  proposed  here  is  entirely  different,  in  that  the  algorithm  needs  only  solve  a  linear 
partial  differential  equation  at  each  stage.  Since  the  problem  is  clearly  nonlinear  in  / ,  the 
calculation  of  the  slopes  requires  an  iterative  procedure  such  as  the  secant  method.  We 
have  performed  several  numerical  experiments  which  indicate  that  the  scheme  is  robust 
under  noisy  data. 

Our  approach  in  this  paper  will  be  to  first  present  some  preliminary  technical  details 
needed  to  set  up  the  problem.  We  then  state  the  main  results  in  the  form  of  several  lemmas 
and  theorems.  A  thorough  discussion  of  the  algorithm  and  a  numerical  example  is  then 
presented,  followed  by  the  proofs  of  all  the  lemmas  and  theorems  in  a  separate  appendix. 
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2.  Preliminaries. 

In  this  section  our  assumptions  on  the  class  of  admissible  data  and  radiation  functions  / 
are  given,  and  we  introduce  some  notation  that  will  be  useful  in  the  sections  to  follow. 

In  order  to  obtain  an  integral  representation  of  the  solution  to  (1.1)  -  (1.3),  it  is  con¬ 
venient  to  consider  the  function,  w(x,t )  that  satisfies  the  initial  boundary  value  problem 

wt  —  wxz  =  g(x,t)  0  <  x  <  1,  0  <  t  <  T 

tox(0,  f )  =  0  -u>*(l,f)  =  0  0  <  f  <  T  (2.1) 

tu(x,0)  =  uo(z)  0  <  x  <  1. 

We  note  that  w(x,t)  =  u(x,i;0)  and  the  solution  of  (1.1)  -  (1.3)  can  be  written  as 

u(x,t)  =  w(x,t)  —  2  f  0(x,<  —  r)/(u(0,  r))dr 

Jo 

—  2  f  6(x  —  l,t  —  r)/(u(l,r))dr  (2.2) 

Jo 

where  9(x,  t )  is  the  theta  function  defined  for  t  >  0  by 

m=oc 

0(x,t)  =  K(x  —  2m,t)  (2-3) 

m=~  oo 

and  K{x,t )  =  e-r,/4t  is  the  fundamental  solution  of  the  free  space  heat  equation. 

See  [1]. 

The  first  lemma  gives  a  regularity  result  for  the  forward  or  direct  problem,  that  is  when 
/  is  a  known  function. 

Lemma  1.  Let  u(x,t)  satisfy  (1.1)  -  (1.3)  with  f  6  L°°  and  uo,  g(x,t )  sufficiently 
smooth.  Then  for  0  <  x  <  1,  0  <  t  <  T,  the  function  u  =  u(x,t;/)  given  by  (2.2)  is 
twice  continuously  differentiable  in  x  and  ut(x,<)  is  C°  in  t  for  0  <  a  <  1/2. 

Since  we  evaluate  solutions  of  the  direct  problem  on  the  boundaries  x  =  0  and  x  =  1 , 
the  following  properties  of  the  theta  function  will  be  needed, 

(1)  B{— x,t)  =  0(x,t)  -  oo  <  x  <  oo,  t>0 

1  00 

(2)  0(— l,f)  =  —==y^  =H\(t).  Hi(t)  is  a  C°°  function  on  [0,oo)  whose 

nth  partial  derivative  in  t ,  evaluated  at  t  =  0,  vanishes. 
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1  1  22  i 

(3)  5(0, t)  =  —f=z=z  -| — -==  V'  _ +  Ho(t),  where  Ho(t)  is  C°°  on  [0,  oo) 

V4 nt  \jTtt  "  v4 nt 

with  ^"^(0)  =  0  for  all  n. 


We  introduce  the  notation 


S<f>(t)  =  2  /  5(— l,f  —  r)<£(r)dr 
Jo 

A<f>(t)  =  —  2  /  5(0,  t  —  r)^(T)dr 
Jo 


The  kernel  of  5  is  a  C°°  function  and  5  is  an  infinitely  smoothing  operator.  The  kernel 
of  the  operator  A  has  a  singularity  of  order  1/2  and  takes  functions  in  Ca  into  functions 
in  Ca+1/2.  If  ip(t)  has  a  bounded  L°°  derivative  then  the  equation  A<f>{t)  —  rfi(t)  can  be 
written  in  the  equivalent  form,  (c.f.  [1]). 

m  -  t?  ( 1 f + £  ^  - + i  H^T)dT}  ^ 

where  H  is  a  C°°  function.  This  equation  is  of  second  kind  Volterra  type  and  can  be 
uniquely  solved  for  the  function  <j> .  Thus 

<t>  =  A~lxj)  =  {I  +  n)xi>  (2.7) 

where  72.  is  the  resolvent  operator  corresponding  to  the  Volterra  integral  equation  (2.6) 
with  smooth  kernel  H  and 


f*  0 '{r)dr  ) 

Jo  y/t-r  J 


We  will  need  some  properties  of  the  operators  A  and  S . 

Lemma  2.  For  any  function  with  <£(0)  =  0  and  in  Lc 


sup  <  CT^II^ 


0<t<T 


(2.10) 


The  values  of  the  overposed  data  h(t )  are  assumed  to  be  given  at  a  discrete  set  of 
points  {tjjo* ,  where  t0  =  0  and  t/v  =  T.  Let 

At  =  sup(tj+i  —tj),  0  <  j  <  N  —  1. 

i 
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The  notation 


i/il, =  l-f(”)l’ 

ll/ll  oo,5  =SUp|/(u)|, 
i »€S 

and 

will  be  used.  The  interval  will  be  ommitted  when  no  confusion  can  arise.  We  use  C 
to  denote  a  generic  constant  that  may  depend  on  the  6  function,  the  data,  and  various 
Lipschitz  constants,  but  will  be  independent  of  At  or  the  function  approximation  to  / 
that  will  be  obtained  by  collocation. 

The  following  assumptions  on  /  and  the  data  are  made: 

Al.  The  function  /(•)  is  nonnegative,  f  6  T°°[0,  oo)  with  ||/'||  ^  <  M  for  some 
fixed  constant  M . 

A2.  The  initial  data  is  such  that  «g  is  in  C1/2 ,  and  the  source  term  g(x,t )  is  C1/2 
in  both  x  and  t.  Thus  w(x,t)  is  a  classical  solution  to  (2.1). 

A3.  The  solution  tt(x,  t)  to  the  direct  problem  (1.1)  -  (1.3)  satisfies 

range0<a<t{u(x,s)}  C  rangeg<,<t{u(0,s)} 

for  all  x  €  {0, 1]  and  t  6  [0,  T] . 

In  order  to  guarantee  the  a  priori  existence  of  a  solution  to  the  inverse  problem  (1.1)-(1.4), 
we  assume  the  overposed  data  h(t)  actually  arises  from  some  admissible  function  /. 

A4.  There  exists  a  function  /(•)  satisfying  Al  such  that  the  solution  u(x,<;/)  to 
(1.1)  -  (1.3)  satisfies  u(0,  t;/)  =  h(t),  where  h  is  a  decreasing  function  of  t 
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3.  The  Collocation  Method. 

The  collocation  approach  to  the  solution  of  (1.1)  -  (1.4)  can  be  described  as  follows.  We 
consider  a  function  /,  piecewise  linear  on  each  of  the  intervals  [<j,  t,+i] ,  with  the  corre¬ 
sponding  function  u(x,<)  =  u(x,t;  f)  defined  on  0  <  i  <  1,  0  <  t  <  T  and  satisfying 

ut  -  uxx  =  g(x,t) 

«*(M)  =  f(u(0,t)),  -  «*(M)  = /(fi(l,0)  (3-1) 

ti(x,0)  =  «o(x). 

Assume  that  /( v)  is  either  known  or  has  been  approximated  by  a  function  f(v)  for  v  > 
h(tj).  On  the  interval  [h(fj+i), h(fj)]  extend  /  by  the  linear  function  f(v)  =  Cj(y  — 
h(tj))  +  f(h(tj))  ,  for  some  constant  Cj  which  should  be  chosen  in  order  to  satisfy  (1.4)  at 
the  next  collocation  point  t  =  tj+ 1,  that  is  to  satisfy  u(0,fy+i;/)  =  h(fy+i). 

One  particular  difficulty  is  the  lack  of  knowledge  of  a  starting  value  for  /(ho).  We 


denote  the  starting  error  /(ho)  —  /(ho)  by  Eq.  We  define  ip  by 

tpt  ~  ipzx  =  g(x,t)  0<x<l,  0  <t<T  (3.2) 

0(x,O)  =  O  0  <  x  <-l  (3.3) 

0*(O  ,t)  =  e(t)  (3.4) 

0,(1, «)  =  0  (3.5) 


where  e(t)  is  chosen  to  satisfy  e(0)  =  Eo  and  so  that  ip  vanishes  at  the  collocation  points. 
Essentially,  the  function  rp  measures  the  propagation  of  an  initial  error  in  /. 

The  main  results  of  this  paper  are: 


Theorem  1.  (Stability  Theorem)  If  assumptions  A1  -  A4  hold,  and  the  initial  error 
|/( ho)  —  /(ho)|  =  Eq  then  for  sufficiently  small  T ,  we  have  the  following  estimate 


0“|TI/  -  /IL,.(°,„  *  C[\\«  -  «|L,„o  +Eo  +  II A  *]H  »,.=«] 

Theorem  2.  (Convergence  Theorem)  If  assumptions  A1  -  A4  hold,  then  there  exists  a 
unique  piecewise  linear  function  f  such  that  the  solution  u(x,t;f )  to  (1.1)  -  (1.3)  with  f 
replaced  by  f  satisfies  (1.4)  at  each  of  the  observation  points  t  =  tj,  j  =  0,  1,  . . .  ,  N . 
Furthermore,  if  the  set  of  slope  constants  remain  bounded  then 

o<t<T  ^  ~  ^  -  C  [^°  +  for  0  <  a  <  1/2 

In  the  next  section,  we  show  that  the  initial  error  can  be  made  0(At )  by  choosing  a 
constant  approximation  to  /  on  the  initial  interval.  Consequently,  we  have 
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Corollary.  ( Startup  Error)  If  the  set  of  slope  constants  {cy}j_^  remains  bounded,  and 
the  startup  error  is  O(At)  then  the  collocation  scheme  converges  globally  of  order  p  where 

p<\. 

The  proofs  of  these  theorems  axe  presented  in  the  Appendix. 

The  next  lemma  is  crucial  to  the  well  posedness  of  the  collocation  method.  We  show 
that  the  boundary  values  of  the  direct  problem  depends  monotonically  on  the  radiation 
function  / 

Lemma  3.  (Monotonicity)  Let  (ui ,  f\ )  and  (v2,  h)  satisfy  (1.1)  -  (1.3)  where  assumptions 
A1  -  A4  hold  for  each  solution  pair.  If  f\  >  f2 ,  then  v\(x,t)  <  v2 (x,t)  for  all  (x,t).  In 
addition,  if  for  some  t' ,  fi(v(l,t'))>  f2(v(l,t')),  then  v\(x,t')  <  v2(x,t'). 

Let  us  define  the  functions  <f>o(t)  and  4>\(t)  by 


<f>o(t)  =  u(0,t)  -u(0,t) 

(3.6) 

M*)  =  u(M)  -  «(!,<) 

(3.7) 

Af(t)  =  f(h(t))  -  f(h(t)) 

(3.8) 

where  u(x,t )  and  u(x,t )  are  the  solutions  of  the  direct  problems  (1.1) 
respectively. 

We  have, 

-  (1.3)  and  (3.1) 

Lemma  4.  Given  the  assumptions  A1  —  A4, 

IWL.M  <C(«||W0II»,m  +v7||a/(()||00i[m). 

(3.9) 

This  says  that  the  error  in  the  approximation  of  the  solution  of  (1.1)  -  (1.3)  by  that  of 
(3.1)  is  no  greater  on  the  “trailing  boundary”  x  =  1,  than  it  is  on  the  “leading  boundary” 
x  =  0  plus  some  contribution  from  the  radiation  at  the  boundary.  On  the  other  hand  the 
difference  in  the  functions  /  and  /  must  satisfy, 

Lemma  5. 


H^/ll  oo.l0.tj  -  C  (  II ^0 1|  ^,[0, t]  +  £  +  II*4  1foo-0]||OO|[O,l)  +*IMoo,(<Mj  )•  (3-10) 


Given  the  collocation  condition  that  <f>o(tj)  =  0  for  j  =  0,  1  . . .  N  we  have  the  estimates, 
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Lemma  6.  If  f  and  /'  6  Loo,  then  for  any  0  <  a  <  1/2  the  function  <f>o(t)  satisfies 

sup  |<£o(*)  —  V’l  <  C  (Ai)1+°  (3-11) 

0<t<T 

sup  |^(0-r/>t|<C(A<)“.  (3.12) 

o<t<r 

We  remark  that  if  u  and  u  are  smooth  enough,  then  we  obtain  estimates  of  the  form 

sup  <j>o (t)  <  C  (At)2  (3.9a) 

o  <t<T 

sup  <f>'Q(t)  <  C  (At)  (3.10a). 

o<«<r 

Lemma  3  is  the  key  to  the  method  in  that  it  guarantees  that  the  piecewise- linear 
collocation  scheme  is  well  defined.  For  suppose  ( u,f )  satisfies  (1.1)  -  (1.3),  and  that 
u(x,t)  for  t  <  t'  and  f(v)  for  v  >  are  assumed  known.  Then  there  is  at  most  one 

linear  function  /  =  c(v  —  u(l,t'))  +b  that  agrees  with  /  at  the  point  v  =  u(l,t')  and  such 
that  u(l,t")  takes  on  a  given  value  for  some  t"  >  t' .  For  if  f\  =  c\(v  —  u(l,t'))  +  6  > 
h  —  ci(v  —  u(l>*;))  +  ^  with  say  c\  >  c^,  then  from  lemma  3,  u(x,t;/i)  >  u(x,t;/2)  for 
all  t  >  t1 .  In  addition,  if  over  the  interval  [<',<"],  either  /  >  /  or  /  <  /,  then  it  could 
not  be  that  u(l,  /";/)  =  u(l,  t"\ /). 

Besides  the  simplicity  of  this  method,  the  collocation  procedure  is  quite  versatile  and 
offers  advantages  over  global  recovery  schemes.  The  fixed  point  recovery  method  for  (1.1) 
-  (1.4)  described  in  [6]  is  difficult  to  formulate  for  overposed  data  that  is  measured  at 
an  interior  point,  or  for  the  case  of  more  than  one  spatial  variable.  Given  any  situation 
where  the  overposed  data  depends  on  the  function  /  in  a  monotone  manner,  the  collocation 
method  can  be  carried  out  in  principle. 

Theorem  3.  (Uniqueness  Theorem)  If  assumptions  A1  —  A4  hold  then  the  difference 
between  any  two  solutions  f\  and  f\  is  a  function  f(v)  with  an  infinite  number  of  zeroes 
in  any  interval  [ui ,  V2 ]  C  [h(T),  /i(0)] , 

Corollary.  If  assumptions  A1  —  A4  hold,  where  f  is  anedytic,  then  there  is  at  most  one 
solution  pair  ( u,/ )  to  (1.1 )  -  (1.4). 

We  remark  that  this  condition  on  the  difference  of  any  two  solutions  f\  and  /2  being 
a  function  with  an  infinite  number  of  zeroes  is  identical  with  the  situation  obtained  by 
several  authors  for  the  reaction-diffusion  equation  ut  —  A u  =  /(u)  where  the  function 
/(u)  has  to  be  determined  from  overspecified  boundary  data,  [2,  3].  Clearly,  an  infinitely 
oscillatory  function  cannot  be  distinguished  by  a  discrete  set  of  boundary  measurements. 
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4.  Analysis  of  the  Algorithm  and  a  Numerical  Example. 

In  this  section  we  describe  a  numerical  implementation  of  the  collocation  scheme. 

(a)  Generate  “‘experimental  data.”  Select  a  function  fact  and  obtain  the  solution  u(-;fact ) 
to  (1.1)  -  (1.3).  The  value  of  this  function  on  the  line  x  =  0  forms  the  overposed  data, 
and  is  passed  to  the  inversion  algorithm  as  h(t)  at  the  discrete  points  tj . 

(b)  Assume  that  f(v )  is  either  known  or  has  been  approximated  by  a  function  f(v)  for 
v  <  h(tj).  On  the  interval  [/i(<j),  h(tj+i)]  extend  /  using  the  linearized  approximation 
f(v)  =  A j  (u  —  h(tj))  +  / ( h(tj )) ,  for  some  constant  A j  which  should  be  chosen  in  order 
to  satisfy  the  collocation  condition  u(0, <,-+i;/)  =  h(tj+\).  We  use  a  secant  method  to 
calculate  the  slope  A  j  from  the  resulting  nonlinear  equation 

(c)  Step  (a)  is  continued  for  each  interval  [tj,  tj+i] . 


There  are  some  remarks  to  be  made  on  the  above  procedure. 

(1)  How  does  one  obtain  the  starting  value  f(h( 0))?  If  the  initial  data  and  boundary  data 

are  compatible  at  t  =  0  then  we  have  f(h( 0))  =  Uq(0)  which  determines  /  initially. 
This  will  be  the  case  if  the  process  has  been  evolving  for  some  positive  time  interval.  If 
the  data  are  incompatible,  we  then  use  collocation  on  to  determine  a  constant 

approximation  to  /  on  this  interval.  This  results  in  an  error  which  is  C?(A/i). 

(2)  The  number  of  iterations  required  to  obtain  the  slope  A  on  a  given  collocation  interval 
will  obviously  depend  on  the  tolerance  required,  and  to  the  deviation  of  the  function  fact 
from  linearity  on  this  interval,  For  most  of  the  step  sizes  and  values  of  an  actual  function 
/)  fact  we  chose,  about  3  iterations  sufficed  to  obtain  A  to  within  the  same  accuracy 
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as  the  forwards  solution  u(x,t;f).  The  cost  of  the  method  is  therefore  2  or  3  times  the 
cost  of  solving  a  linear  direct  problem,  since  the  approximation  is  by  a  linear  function 
on  each  of  the  intervals  [tj,  tj+i}. 

(3)  An  alternative  method  for  this  problem  would  be  to  let  /  depend  on  M  parameters 
{(ci, . . . , Cj)} ,  and  impose  N  constraints  to  determine  the  Cj ’s  by  a  least  squares  proce¬ 
dure.  In  our  problem  this  is  not  necessary  because  M  =  N ,  and  for  a  parabolic  equation 
the  values  of  u(xo,*i)  do  not  affect  the  values  of  u(xo>*2)  if  >  <2-  Thus  the  value 
of  Cj  does  not  depend  on  the  value  of  Cj+\.  This  allows  the  coefficients  to  be  solved 
sequentially.  This  gives  the  collocation  method  a  computational  edge  with  respect  to 
other  global  recovery  schemes. 

(4)  The  collocation  method  for  this  problem  leads  to  an  efficient  scheme  for  obtaining  an 
approximating  function  /.  The  (almost)  square  root  convergence  of  this  approximation 
to  /,  as  a  function  of  the  stepsize  At  is  a  worst  case  analysis.  The  observed  rate  of 
convergence  for  a  wide  variety  of  test  examples  was  nearly  linear.  To  find  a  value  for  the 
slope  of  /  over  the  interval  [tj,tj+ 1]  that  matched  u(0, t)  to  h(t)  to  within  sufficient 
accuracy  to  be  compatible  with  the  rest  of  the  numerical  scheme  required  only  a  few 
iterations  of  a  procedure  like  the  secant  method.  At  each  stage  of  this  process  we  must 
of  course  solve  the  direct  problem  over  the  interval  [tj,tj+ i]  with  our  guessed  value  of 
/.  Since  this  is  a  linear  function,  we  axe  only  required  to  solve  the  heat  equation  with 
linear  boundary  conditions. 

(5)  In  [6],  an  iteration  scheme  was  used  to  construct  an  approximating  sequence  to  f(u). 
Although  convergence  was  quite  rapid,  (effective  approximation  was  obtained  within  a 
few  iterations),  each  stage  of  this  scheme  required  the  solution  of  the  heat  equation  with 
nonlinear  boundary  conditions.  This  iteration  approach  did  however  allow  us  to  obtain 
an  existence,  uniqueness  and  stability  result  for  the  inverse  problem  (1.1)  -  (1.4).  The 
collocation  method  is  not  suited  to  this  task.  However,  since  it  is  based  on  monotonicity 
of  the  data  u(0,t;  /)  on  the  function  /,  we  can  use  this  to  obtain  a  uniqueness  theorem, 
although  this  is  weaker  than  that  obtained  by  the  methods  of  [6]. 

We  choose  fact  =  3u2  —  2u3 .  The  function  u(x,t;fact)  wa s  calculated  numerically  and  the 
values  of  h(t )  =  u(l,tj]  fact)  at  the  points  tj  =  j/N  for  j  =  0,  1,  ...  ,  N ,  used  as  data  for 
the  collocation  scheme.  We  used  a  step  size  of  k  =  0.005  in  time  and  h  =  0.04  in  the  spatial 
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direction.  The  table  below  shows  the  difference  of  /  and  /  in  the  supremum  and  L 2  norms 
for  various  values  of  N . 


Convergence  rate  of  /  to  fact  • 

N 

ll/act  —  /||  oo 

||/act  ~  /||  £,* 

1 

10 

25 

50 

0.0010 

Notice  that  the  convergence  of  /  to  fact  appears  to  be  nearly  linear  in  A t  =  1/jV,  rather 
than  the  square  root  dependence  obtained  in  the  proof  of  Theorem  1.  This  is  due  to  the 
fact  that  the  fact  generating  the  data  is  analytic,  which  implies  that  the  exact  solution  is 
infinitely  smooth.  The  apparent  linear  convergence  is  only  asymptotic,  and  the  last  entries 
of  the  above  table  are  close  to  the  limitations  imposed  by  truncation  error. 


5.  Appendix:  Proofs  of  the  lemmas. 

We  collect  here  the  proofs  of  the  lemmas  used  in  the  proof  of  theorem  1. 
Proof  of  Lemma  1:  Let  /(0)=0  with  f  €  Loo  and 


/(*  ~  T ) 


dr 


Differentiating  once,  in  t,  we  get 


Since 


U(t)  =  u' 


f'(*  ~  r) 


dr 


U(t  +  h)~  U(t)  = 


f\r) 


y/t  +  h  —  T 


dr 


Therefore 


\U(t  +  h)~  U(t)\  <  ll/'lloo 


<cil/'ILfc,/2(i+iog(/.)) 
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We  lose  half  a  derivative  due  to  the  singularity  of  order  1/2  in  the  kernel.  This  carries  over 
to  the  solution  of  (1.1)-(1.3)  via  the  representation  (2.2)  □. 

Proof  of  Lemma  2:  From  equations  (2.7)  and  (2.8)  we  have  using  0(0)  =  0 , 

Jo  V  *  -  T 

where  H  is  an  integral  operator  with  smooth  kernel.  The  first  equation,  (2.9)  now  follows. 
For  the  second  part,  we  have 

A~lSxp(t)  =  s)tp(s)dsdr +  H[(t  —  r)0(r)dr| 

since  H\(0)  =  0.  The  function  H[{t)  is  continuous  and  hence  the  kernel  of  the  first  integral 
operator  is  integrable.  Equation  (3.3)  is  now  immediate.  □ 

Proof  of  Lemma  3:  The  function  v  —  v\  —  must  satisfy 


vt  —  vxx  =  0 
u(s,0)  =  0 

u*(0,t)  -  f[((o(t))v(0,t)  =  /i(t>2(0,<))  -/2(*>2(0,f)) 
-vx(l,t)  -  -Mv2(M)) 


for  some  functions  £o(*)  and  £i(t).  Let  a(x,t )  be  any  function  in  C2([0, 1])  x  Cl([0,  T]) 
such  that 


at  -  axx 


«.(0,0  =  -/{(&(<)) ,  «*(M)  =  /!( 6(0) 

and  put  w(x,t )  =  u(x,<)e0,(l’^ .  Then  iw(x,f)  satisfies 


wt  -  wxx  +  2 azwx  =  0 


u>(x,  0)  =  0 

w«(0,f)  =  e-«(°.*)|/l(W2(0,t))  -  /2(i>2(0,t))}  >  0 

-t»«(M)  =  e^'^/iWl,*))  "  AWl,0)}  >  0 
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since  f\  >  fa .  The  maximum  principle  for  the  principal  part  of  the  operator  now  shows  that 
w  <  0 ,  and  thus  uj  <  vi .  The  sharper  conclusion  of  the  second  part  follows  from  the  fact 
that  if  the  boundary  conditions  in  the  above  boundary  value  problem  are  inhomogeneous,  w 
cannot  identically  vanish.  Q 


(5.1) 


Proof  of  Lemma  4:  If  we  substitute  the  pairs  (u ,  /)  and  (u ,  /)  into  equation  (2.2)  and 
subtract  we  obtain 

it(x,  t )  -  u(x,  t)  =  2  f  9(x,  t  -  T )  {/(u(0,  r))  -  f(u( 0,  r))  }  dr 
Jo 

-2  f  9(x  -  1,  t  -  r)  {/(u(l,  r))  -  /(u(l,  r))  }  dr. 

Jo 

Evaluating  this  equation  on  the  line  x  =  1 ,  and  expanding  terms,  gives 

<t>i(t)  =  2  f  9(l,t-r)  {/(u(0,  t))  -  /(«( 0,  r))  }  dr 
Jo 

+  2  /  9(1,  t  —  t)  {/(u(0,r))  —  /(fi(0,r))}  dr 
Jo 

-  2  /  9(0,  t  -  r){/(u(l,  r))  -  /(u(l, r))  }dr 
Jo 

-2  f  9(0,  t  —  r){/(u(l,r))  —  /(ii(l,r))}  dr 
Jo 


and  thus 


l*i(0l  <  2M  /  \9(l,t  -  r)|  |<^o(r)j  dr  +  [  |tf(l,<  -  r)||/(u(0,r))  - /(u(0,r))|  dr 
Jo  Jo 

+  2M  [  |0(O,  t  —  r)|  \<j>\(r )|  dr  +  !  |^(0,t  -  r)||/(u(l,r))  -  /(u(l,r))|  dr. 

Jo  Jo 

By  assumption  the  range  of  values  of  u  on  the  line  x  =  1  is  contained  in  the  range  over 
x  =  0  for  any  interval  0  <  t  <  T  and  by  the  collocation  assumption  u(0,t)  =  u(0,t)  =  h(t ) 
at  each  of  the  points  tj ,  we  have 

\4>i(t)\<2M  [  \9(l,t -r)\\<f>o(T)\dr  +  f  |0(1,<  -  r)|A/(r)  dr 
Jo  Jo 

+  /  \9(0,t  -  r)\Af(r)dr  +  2M  f  |0(O,<  -  r)|  |^i(r)|  dr. 

Jo  Jo 

Applying  Gronwall’s  inequality  to  the  above  inequality,  noting  that  the  function  0(0,  t)  has 
an  integrable  singularity  at  t  =  0 ,  gives  the  statement  of  the  lemma.  □ 
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Proof  of  Lemma  5:  We  have  the  identity 


u(x,t)-u(x,t)  =  ij>(x,t)-  2  f  0(x,t  -  r)  {/(u(0,r))  -  f(u(0,r))  -  e(t)}  dr 

Jo 

-2  f  Q{x  —  l,t  —  r)  {/(u(l,r))  —  /(u(l,r))}  dr.  (5.2) 

Jo 

Evaluating  on  the  line  x  =  0  gives 


MO  =  -4{/(u(0,  t))  -  /(fi(0,i))  -  £(()}  +  V’(o.i) 

+  5{/(U(l,t)) (»} 

expanding  the  arguments,  and  rearranging  terms,  we  obtain  the  equivalent  expression 
A{f(ii(0,t))-f(u(0,t))  -e(t)}  =  A{f(u(0,t))  -/(u(0,t))}  +  cf>0  -^(0,<) 

+  ${/(u(M))  -/(«(M))  +/(o(l,0)  -/(«(!, 0)} 

or 

/(u(0,  t))  -  /(u(0,  t ))  =  e (*)  +  /(u(0,  f))  -  /(u(0,  t ))  +  1  [<£0  - t/>] 

+  -4-1«S{/(u(l,t))  -/(u(l,t))} 

+  A-1S{f(u(iyt))-f(n(i,t))}. 

Since  .4-1<S  is  an  integral  operator  with  an  integrable  kernel,  we  can  apply  Gronwall’s 
inequality  to  the  function 

/(«( 0,0)  -/(«( o,0) 

and  use  the  range  assumption  that 

[u(l,0),tZ(l,t)J  C  [u(0,0),u(0,<)] 


to  obtain 

|/(a(0,t))  -  /(fi(0,0)|  <  |/(a(0,O)  -  /(8(<M))I  +  l>r1[^o  -  0]|  +  l«(*)l 

+  M_15{/(a(l,<))  -/(“(!,  0)}  |- 


Using  the  uniform  Lipschitz  assumption  on  /,  and  (2.10),  we  obtain 


|/(s(0,f))  -/(a(o,0)|  <  m\M0\  +  M"'W.  -*]|  +  CJ«|K,||„oJM  +  |e(«)|. 
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Taking  the  supremum  of  the  right  hand  side  (over  t ),  we  obtain  the  conclusion  of  the  lemma. 

□ 

Proof  of  Lemma  6:  Since  /  is  piecewise  linear,  with  slopes  Cj  we  have  ||/,||oe)  = 
maxj=i)...iiv{cj}.  Therefore,  since  (/-/-£)'  €  Too  and  (/  —  /  -  e)  =  0  at  t  =  0, 
( ut  —  ut  —  ipt)  6  Cx/2[0,  T] .  By  the  collocation  condition,  u  —  u  —  ip  vanishes  at  the  colloca¬ 
tion  points,  and  by  the  intermediate  value  theorem  the  derivative  vanishes  at  some  point  in 
the  interior  of  each  interval.  This  immediately  yields  the  result 

sup  |it  —  u  —  ip\  <  C  (At)1+a  (5-3) 

0<t<T 

sup  |ut  —  ut  —  ipt\  <  C  (At)0  (5.4) 

0<t<T 

for  any  0  <  a  <  1/2.  Note  that  it  is  only  the  smoothness  of  /  and  the  initial  error 
(propagated  through  ip )  that  limit  the  convergence  rate.  □ 

Proof  of  Theorem  1:  To  prove  Theorem  1,  we  substitute  (3.9)  into  (3.10),  collect  the  terms 
involving  ||A/||  ^  [0  ^  ,  and  sup  over  t  6  [0,T]  with  T  sufficiently  small.  The  estimate 
essentially  follows  from  the  representation  theorem,  and  is  valid  for  any  /  6  C“.  The 
procedure  can  be  continued  for  large  T  by  bootstrapping.  □ 

Proof  of  Theorem  2;  The  fact  that  there  is  a  unique  piecewise-linear  function  /  follows 
from  monotonicity.  We  use  Lemmas  2  and  6  to  obtain  the  stated  bound.  □ 

Proof  of  Theorem  3:  Let  («i,/i)  and  ( U2,/2 )  be  two  solutions  to  (1.1)  -  (1-4).  We  make 
the  assumptions  that  the  data  g(x,t )  and  uo(x)  satisfy  A3  and  A4,  and  assume  that  both 
fi  and  fi  lie  in  Ca  for  a  >  1/2.  This  last  condition  is  to  guarantee  the  existence  of  a 
strong  solution  to  the  direct  problem. 

If  /i  (^(0))  >  fi (h(0))  then  for  some  t'  >  0,  fi(v)  >  f2(v)  for  h(t')  <  v  <  /i(0).  Lemma  3 
then  gives  t<i(x,t;/i)  <  U2(x,f;/2)  for  0  <  t  <  t'  and  in  particular  ui(l,f)  <  u2(l,t)  which 
is  in  violation  of  (1.4),  since  both  these  functions  must  agree  with  the  overposed  data  h(t) 
at  this  point. 

If  /i(/i(0))  —  fi  (h(0)) ,  and  f\  —  fi  does  not  vzinish  identically  in  a  neighborhood  of  h(0), 
then  there  is  a  t'  >  0,  such  that  for  h(t')  <v<  h( 0)  either  f\(v)  >  fi(v),  or  f\{v)  <  fa{y ), 
or  fi(v)  =  /2(f)  +  f(v)  where  /  is  a  Ca  function  with  an  infinite  number  of  zeroes  in  any 
neighborhood  of  h( 0).  In  the  first  two  cases  lemma  3  once  again  gives  a  contradiction  with 
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(1.4).  The  last  case  is  precisely  the  statement  of  theorem  2,  and  the  corollary  is  a  direct 
consequence.  □ 

Note  that  if  the  data  and  /  are  sufficiently  smooth  then  the  solution  u  will  be  C2  in  time, 
globally.  For  /  piecewise  linear,  u  will  be  C2  in  time  on  the  boundary  between  collocation 
points  {t«},  but  not  globally.  (The  discontinuities  of  the  derivatives  of  u  on  the  boundary 
can  occur  only  at  the  collocation  points.) 
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and  continuous  dependence  of  the  initial  distribution  function  on  the  data  are  addressed.  Some 
numerical  simulations  are  presented  to  illustrate  the  feasibility  of  recovery  using  the  methods  of 
the  paper. 
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1.  Introduction. 


In  population  dynamic  models  that  consider  the  age  structure  of  the  species,  we  are  interested  in 
finding  p(a,  t ) ,  the  number  of  individuals  of  age  a  alive  at  time  t .  The  total  population  P(t)  of  the 
species  is  then  given  by  P(t)  =  J"QL  p(a,  t)  da ,  where  L  is  the  maximum  life  span  of  the  species.  The 
first  mathematical  model  along  these  lines  was  proposed  by  Lotka.  In  this  treatment  one  assumes 
as  known,  a  birth  function  0  and  a  death  function  A .  0(a)da  and  A (a)da  are  the  probability  of 
an  individual  of  age  a  giving  birth  or  dying  in  the  interval  a  to  a  +  da.  The  functions  0  and  A 
depend  on  the  age  a  of  the  individual.  Further,  to  incorporate  cooperation  or  competition  between 
individuals  of  the  species,  they  should  be  considered  to  depend  on  the  total  population  P(t)  or 
possibly  on  p(a,  t ) .  In  this  paper  we  restrict  our  attentions  to  the  linear  model. 

One  standard  problem  is  the  following;  given  an  initial  population  with  age  distribution  <f>(a)  = 
p(a ,  0) ,  and  the  functions  0  and  A ,  determine  the  time  evolution  of  the  population  p(a,  t) .  We 
shall  refer  to  this  problem  as  the  direct  problem.  In  the  case  where  A  and  0  are  functions  only  of 
age,  the  models  of  Lotka,  [Lotka,  1956],  and  McKendrick  [McKendrick,  1926]  leads  to  respectively 
a  linear  integral  or  a  linear  differential  equation  to  be  solved  for  the  function  p(a,  t) .  A  nonlinear 
model  where  the  birth  and  death  processes  depend  on  the  total  population  was  introduced  by 
Gurtin  and  MacCamy  [Gurtin  and  MacCamy,  1974],  and  Hoppensteadt  [Hoppensteadt,  1974]. 

In  practice,  it  may  not  be  possible  to  monitor  a  species  sufficiently  closely  in  order  to  obtain  a 
good  approximation  to  some  of  the  parameters  required  for  the  solution  of  the  direct  problem.  In 
a  recent  paper,  [Rundell,  1989],  it  was  shown  that  one  could  recover  an  unknown  birth  function 
from  overposed  data  consisting  of  the  total  population,  a  quantity  that  may  be  much  more  easily 
determined  by  an  experimenter,  over  an  appropriate  time  interval.  Problems  such  as  these  are 
usually  referred  to  as  inverse  problems.  Another  situation  that  can  occur  is  when  one  is  unable  to 
accurately  compute  the  value  of  the  initial  age  structure,  that  is,  when  it  is  difficult  or  impossible 
to  take  a  census.  In  this  paper  we  investigate  the  possibility  of  determining  the  initial  age  structure 
<p(a)  from  a  knowledge  of  the  birth  and  death  functions  and  the  total  population  P(t)  measured 
over  an  interval  of  time  equal  to  a  lifespan  of  the  species.  We  show  that  conditions  can  be  imposed 
on  the  data  that  will  guarantee  a  unique  solution  of  the  inverse  problem,  and  that  the  stability  of 
the  recovery  scheme  depends  on  the  behaviour  of  the  death  function  near  the  maximum  life  span. 
We  also  investigate  the  problem  of  recovering  an  “initial”  age  structure  4>{a) ,  assuming  as  known 
A  and  0,  from  a  knowledge  of  the  age  structure  p(a,T)  at  a  later  time  T.  Thus  we  attempt 
to  recover  unknown  census  data  at  a  previous  time  by  using  information  on  the  birth  and  death 
process  and  census  data  taken  at  the  present. 

We  have  performed  numerical  experiments  to  investigate  the  feasibility  of  the  methods  we  introduce, 
and  some  of  these  are  presented  in  this  paper. 


1 


2.  Notation  and  the  solution  of  the  Direct  Problem. 

We  let  L  denote  the  maximum  life  span  of  the  species,  and  use  p(a,t)  to  represent  the  number 
of  individuals  of  age  a  alive  at  time  t.  We  use  A  =  A(a)  to  represent  the  death  function,  and 
f3  =  /3(a)  for  the  birth  function.  The  differentials  Ada  and  (3da  represent  the  probabilities  of  an 
individual  dying  or  giving  birth  in  the  interval  (a,  a  +  da)  respectively.  The  function 

r(a)  =  exp  j—  A(r)dr}  (1) 

is  called  the  life  table  of  the  species,  and  is  the  probability  of  an  individual  living  to  age  a.  x(a)  is  a 
smooth  monotone  non-increasing  function  of  age,  satisfying  ?r(0)  =  1  and  x(L)  =  0.  B{t)  =  p(0,f) 
is  the  birth  rate  at  time  t  and  can  be  written  in  terms  of  the  birth  function  as 

B(t)  =  f  p(a,t)0(a)da .  (2) 

Jo 

The  initial  age  distribution  <j>(a )  is  given  by 


p(a,0)  =  <t>(a),. 


(3) 


It  will  be  convenient  to  introduce  the  weighted  variables  <j>(a)  =  <j>(a)/x(a),  and  the  net  maternity 
function  b(a)  =  (3(a)x(a) .  It  can  easily  be  shown  that  if  the  life  table,  birth  rate  and  initial  age 
distribution  are  known  then  p(a,t)  can  be  computed  and  is  given  by,  ([Keyfitz,  1968]  or  [Webb, 
1985]) 

p(a  t)=f  *(«)#«  “  0  ^ 

(  x(a)B(t  -  a)  if  t  >  a. 

In  fact  (4)  can  be  used  to  show  that  B(t)  must  satisfy  the  Sharp-Lotka  equation 


(4) 


B(t)  =  i:  b{d)B{t  -  a)da+  Ji  b(a)$(a  -  t)  da 


(5) 


valid  for  t  <  L.  We  also  note  that  if  b  vanishes  on  the  interval  a-i  <  t  <  L  then  we  have  on  this 
interval 


If  t  >  L,  B(t)  satisfies 


B(t)  =  [  b(a)B(t  -  a)  da . 

Jo 

B{t)  =  /  b(a)B(t  -  a) da. 
Jo 


(6) 


(7) 


The  first  integral  in  (5)  represents  the  births  from  those  individuals  born  in  the  interval  (0,  t] ,  while 
the  second  represents  the  births  at  time  t  from  the  initial  population.  Given  0 ,  x  and  <j> ,  this 
linear  Volterra  integral  equation  of  the  second  kind  can  be  uniquely  solved  for  B(t) .  The  total 
population  alive  at  time  t,  P(t)  =  p(a,t)da,  is  given  by  the  relation 


P(t)  =  J  x {a)B{t  -  a)  da  +  J i  x(a)j>(a  -  t)  da 


(8) 
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It  can  be  shown  that  if  /?,  A  and  <$>  are  nonnegative,  continuous  functions  on  [0,  Z-),  then  p(a,t) 
is  continuous  on  [0, t)  and  ( t,L ),  with  at  most  a  jump  discontinuity  at  a  =  t.  From  equation  (4) 
the  function  p(a,t )  will  be  continuous  at  a  =  t  if  and  only  if  <£(0)  =  2?(0)  =  f5(a)4>(a)da.  We 

do  not  impose  this  requirement  on  our  model. 

In  McKendrick’s  equivalent  formulation  of  the  direct  problem  [McKendrick,  1926],  p(a,  t)  is  given 
by  the  solution  of  the  first  order  hyperbolic  equation 

f£  +  ^  +  A(a)p  =  °,  (9) 


with  the  initial-boundary  conditions  formed  by  equation  (3)  and 


(3{a)p(a,t)d.a. 


(10) 


This  approach  will  be  useful  in  the  reconstruction  of  the  initial  distribution  from  the  values  of 
p(a ,  T )  for  some  later  time  T . 


Some  remarks  on  notation. 

The  rth  derivative  of  a  function  f(t)  we  denote  by  f^(t) . 

We  let  £[/]  denote  the  Laplace  transform  of  a  function  and  we  denote  by  f  *  g  the  convolution 

Jo  /(*  ~  3)9(s)ds. 

For  a  function  f(s)  defined  on  the  interval  0  <  s  <  L  we  define  f(s)  =  f(L  -  s). 

The  following  result  is  elementary,  but  will  be  used  sufficiently  often  in  this  paper  to  merit  separate 
attention. 


Lemma  1.  For  any  continuous  functions  k  and  g  defined  on  a  subset  of  [0,  oo)  with  k  not 
identically  zero  there  is  at  most  one  solution  of  the  convolution  equation  k  *  f  s=  g.  If  k  and  g 
have  m  continuous  derivatives  with  k^r\ 0)  =  0  for  r  <  m  while  fc(m)(0)  ^  0,  then  there  exists  a 
continuous  solution  f  of  the  convolution  equation  and  this  solution  depends  continuously  on  the 
functions  k(m)  and  g(m)  in  the  supremum  norm. 

Proof:  For  the  first  part,  note  that  by  extending  as  a  constant  to  all  [0,  oo)  we  can  form  the 
Laplace  transform  of  the  functions  and  obtain  the  relation  C[k\.C[f\  =  C\g\.  If  the  convolution 
equation  had  two  solutions  f\  and  then  their  difference  /  would  satisfy  C[k\.C[f)  =  0.  Since 
the  product  of  the  two  analytic  functions  £[&]  and  £[/]  cannot  vanish  identically  unless  one  of  the 
is  identically  zero,  we  obtain  that  £[/]  =  0  and  hence  /  =  0 . 

For  the  second  part,  if  we  differentiate  the  convolution  m  times,  using  the  fact  that  fc(r)(0)  =  0  if 
r  <  m,  we  have  the  second  kind  Volterra  equation  fc(m)(0)/(t)  +  k^m\t  —  3)f{s)ds  =  g(m)(t). 

A  continuous  solution  to  this  type  of  equation  exists  provided  that  the  kernel  and  right-hand  side 
are  continuous,  [Linz,  1985]. 

This  result  cannot  be  improved  upon,  that  is,  we  cannot  expect  continuous  dependence  of  the 
solution  /  to  the  convolution  equation  k*f  =  g  unless  the  above  conditions  hold.  A  counterexample 
will  be  provided  later  in  the  context  of  the  inverse  problem. 
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Recovery  of  initial  distribution  from  total  population 

We  shall  make  the  following  assumption  on  the  birth  and  death  processes, 

Al.  For  some  positive  integer  m  >  1,  x(a)  and  /3(a)  are  2m+l  times  continuously  differentiable 
on  [0,1.].  0(a)  is  nonnegative  and  has  support  contained  in  the  interval  [01,02]  where 
0  <  ai  <  02  <  L .  x(a)  is  a  monotonically  nonincreasing  function  with  jt(0)  =  1  and  whose 
r*h  derivative  x ^(L)  is  zero  for  0  <  r  <  m,  but  x^m\L)  ^  0. 

These  assumptions  should  be  valid  for  a  wide  range  of  species.  The  behaviour  of  x (a)  at  the 
maximum  life  span  L  will  be  crucial  to  our  analysis.  By  definition  this  function  must  vanish  at 
a  =  L,  but  the  issue  is  whether  its  derivatives  do  likewise.  For  humans  in  a  developed  society  it 
may  very  well  be  that  x^r\L)  =  0  for  all  values  of  r,  but  this  is  unlikely  to  be  true  for  any  species 
for  which  the  aged  are  under  increased  environmental  pressure.  As  we  will  see,  the  larger  the  value 
of  m,  the  more  ill-conditioned  the  inverse  problems  will  be.  In  our  numerical  runs  we  have  used 
m  =  1. 

We  assume  that  the  function  P(t)  is  known  for  0  <  t  <  L  and  satisfies 
A2.  P(t )  is  m  +  2  times  continuously  differentiable  on  the  interval  [0,  L\ 

We  can  rewrite  equations  (5)  and  (8)  in  the  form 


B(t)  =  (b*  B)(t)  +  ^(t) 
P(t)  =  (x*B)(t)  +  V2(t) 


where 


$i(f)  =  b(a)4>(a  —  t)  da 

*2  (0  =  r  x(a)<j>(a  —  t )  da 


(11) 

(12) 


(13) 

(14) 


Lemma  2.  The  functions  and  $2  are  related  by  (t)  =  jf"  7(0  -  t)'®2(a)  da  where  7(f)  is 
the  solution  of 


7r(a  +  t)i(a)  da , 


(15) 


or  equivalently 

6(f)  =  f  #(f  —  0)7(0) da. 
Jo 


(16) 


The  function  7  is  m  times  continuously  differentiable  on  [0 ,L]  and  7  vanishes  identically  on 
[0,  L  -  a2]  • 


Proof:  We  have  $i(f)  =  f*  b(t  -  a)4>(a)da  and  V2(t)  =  /J  f(t  -  a)4>(a)da,  that  is,  =b*4>  and 
$2  =  *  *  4>- 
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Now 


[  7(<  -  a)4,2(-s)^  =  f  7(*  -  •»)  /  #(•»  -  a)j>(a)  dc  ds 
Jo  Jo  Jo 

-a:  7 (t  —  s)f(s  —  a)  ds  4>{a)  da 


—  I  I  7  (t  -a  -  s)ir(s)  ds  #a)  da 
Jo  Jo 

=  f  h(t  -  a)4>(a)  da 
Jo 


=  (0 


where  we  have  used  (16).  This  gives 


¥i(0  =  /  i(t  -  s)$2(s)  ds 

Jo 

=  [  +  °  —  L)'V'i(a)  da 

JL-t 

or 

7(a-*)^2(a)da 

Note  that  since  t(T)  =  0,  a  differentiation  of  (15)  will  lead  to  another  Volterra  integral  equation 
of  the  first  kind  for  7.  Since  ic(m\L)  ^  0,  m+  1  differentiations  will  convert  this  to  a  second  kind 
Volterra  equation 

b<m+1\t)  =  {-l)m  frm\L)~,(L  -  t)  +  £  Vm+1)(a  +  f)7(«)<to}  (17) 

from  which  7  can  be  uniquely  recovered.  The  fact  that  7(f)  is  zero  for  0  <  t  <  L  -  02  follows 
from  assumption  A1  and  (16). 

This  completes  the  proof  of  the  lemma. 

From  (11)  and  (12)  we  obtain 

ir*B  =  ir*b*B  +  ir*’9\ 
b*  P  =  b*v  *  B  +  6*^2 


and  hence 


*2  -  &  *  *2  +  *  *  *i  =  P  -  b*  P. 


Using  lemma  2  and  some  amount  of  rearrangement  we  can  write  this  as 


where 


¥2(0+  /  K(t,a)92(a)da  =  d(t) 

Jo 

d(t)  =  P{t)~  f  b(t  —  a)P(a)  da 
Jo 
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and 


\  _  /  io1  T(*  “  a  +  3)i(s)ds  -  b(t  -  a)  a  <  t, 

(  ’ 0  I  /“-t  »■(<“«  +  s)7(5)ds  a  >  <• 

The  continuity  assumptions  on  7  and  x  show  that  K(t,a)  is  m+1  times  continuously  differentiable 
in  a  and  t  for  t  £  a. 

For  a  <  t  and  1  <  r  <  m  +  1  we  have 

* (r)(t  -a  +  s)y(s)  ds  -  b(r)(t  -  a) 


Consequently, 


(19) 


For  a  >  t  and  l<r<m+l  we  obtain  by  using  the  properties  of  7  and  its  derivatives  at  t  =  0 
that 

drK 
dr 


—  =  V]  xA:(0)7(r-1-*)(a  -  t)  +  [  x(r)(t  -  a  +  3)y(s)ds 


and  hence 

lim.  4“  =  /  *{r)(s)y{s)ds.  (20) 

a-*t+  air  Jo 

Equations  (19)  and  (20)  now  show  that  K(t,a)  and  its  partial  derivatives  up  to  order  m  +  1  are 
continuous  for  0  <  t  <  L . 

The  operator  K,f  =  /QL  K(t,  a)f(a)da  is  compact  and  thus  has  only  a  finite  number  of  eigenvalues 
in  the  exterior  of  any  neighborhood  excluding  the  origin.  Furthermore  1C  is  neither  symmetric 
nor  definite.  If  the  value  —1  is  not  in  the  spectrum  of  the  operator  then  there  will  be  a  unique 
solution  to  (18).  This  condition  depends  only  on  the  birth  and  death  functions.  It  is  difficult 
to  give  conditions  on  these  functions  that  will  guarantee  the  solvability  of  (18).  Numerically  one 
can  compute  the  spectrum  of  /C,  in  particular  the  eigenvalue  with  largest  negative  real  part.  For 
all  reasonable  functions  x  and  (3  that  we  investigated,  this  eigenvalue  had  significant  complex 
component.  We  will  therefore  assume  that 

A3.  The  functions  x  and  (3  are  such  that  the  spectrum  of  1C  does  not  contain  the  point  -1. 

We  now  show  how  to  recover  <j>.  The  regularity  assumption  A2  shows  that  d(t)  is  m  +  1  times 
differentiable.  Thus  a  solution  of  (18)  will  possess  m  +  1  continuous  derivatives  in  view  of  what  we 
have  shown  about  K(t,a) 

By  differentiating  (14)  m  +  1  times  and  using  the  fact  that  n^r\L)  =  0  for  r  <  m,  we  obtain 


or 


'P(2m+1'(t) 


=  (-l)m{x<m>(I)#£  -  0  +  £  1  x<m+1>(a  +  t)<j>(a)  da | 


$<m+1)(Z  -  t)  =  -x(m>(Z)<£(0  +  x(m+1)(I  -  t  +  a)<£(a)  da 


(21) 
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From  assumptions  A1  it  follows  that  the  second  kind  Volterra  equation  (18)  has  a  unique  continuous 
solution  4>  on  [0,  X]  that  depends  continuously  on  the  functions  a-ad  7r(m+1)(a)  in  the 

supremum  norm.  From  (18)  and  Lemma  1,  this  means  that  <j>  depends  continuously  on  the  functions 
7r<m+1)(a),  /?(m+1)(t)  and  P<m+1>(0  ,  in  the  supremum  norm. 

We  thus  have  shown 

Theorem  1.  If  the  birth  function  and  life  table  satisfy  assumption  Al,  A2  and  A3,  then  there 
exists  a  unique  initial  age  distribution  0(a)  which  gives  rise  to  an  observed  total  population  on  the 
interval  [0,  L] . 

Remarks: 

1.  The  assumptions  Al  -  A3  are  not  sufficient  to  guarantee  that  the  initial  function  0(a) 
will  be  nonnegative,  which  is  the  only  admissible  class  of  such  functions.  From  a  practical 
standpoint  this  is  not  a  serious  objection  since  any  physically  observed  P(t)  must  come  from 
a  nonnegative  initial  distribution. 

2.  From  the  above  it  is  clear  that  one  must  provide  the  values  of  the  overposed  data  on  an 
interval  of  length  L .  If  this  interval  is  in  [T,  L  +  T]  then  this  analysis  shows  that  one  can 
recover  p(a,  T ) .  When  T  =  0  this  recovers  the  initial  population  structure  p(a,  0)  =  4>{a)  • 
In  the  next  section  we  will  discuss  the  problem  of  determining  0(a)  from  overposed  data 
P(a,  T). 


Recovery  of  initial  distribution  from  census  data 

Here  we  assume  that  the  functions  x(a),  /?(a)  are  known  as  well  as  the  function  0(a)  which  is 

equal  to  p(a,T )  for  some  fixed  T  >  0. 

For  this  inverse  problem  we  will  require  slightly  different  conditions  on  the  birth  and  death  processes. 

Bl.  For  some  positive  integer  m  >  1 ,  x(a)  and  /3(a)  are  m  + 1  times  continuously  differentiable 
on  [0,  X] .  7r(a)  is  a  monotonically  nonincreasing  function  with  x(0)  =  1  and  ir(L)  =  0. 
/3(a)  is  nonnegative,  has  support  contained  in  the  interval  [01,02]  where  0  <  ai  <  02  <  L, 
and  /?<ml(a2 )  5^  0. 

For  the  overposed  data  0(a),  we  require 

B2.  0(a)  is  an  m  +  1  times  continuously  differentiable  function  on  [0,T)  and  on  (T, L).  It 
has  at  most  a  jump  discontinuity  at  a  =  T  in  the  case  that  T  <  L.  If  T  <  then  the 
compatibility  condition  0(r>(O)  =  (3(a)for\a)da  +  f3(a)rj>(r\a)da  holds. 

Note  that  we  are  imposing  a  boundary  condition  of  the  support  of  the  birth  function,  rather  than 

on  the  death  function  as  we  did  in  the  previous  section. 

We  first  show  that  we  can  construct  the  birth  function  B(t)  over  the  range  0  <  t  <  T . 

In  the  case  T  <  L  we  have  from  (4)  that 


0(a)  = 


x(a)0(a  -  T) 
x(a)B(T  -  a) 


if  a  >  T, 
if  a  <  T, 


(22) 
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and  thus  knowledge  of  ip{a)  implies  that  we  may  simply  read  off  the  values  of  4>{a)  for  the  range 
0  <  a  <  L  -  T .  To  recover  the  initial  distribution  for  ages  a  >  L  -  T  we  must  use  the  information 
provided  by  0  to  obtain  the  birth  function  B(t)  over  the  range  0  <  t  <  T .  We  assume  that  B  is 
continuous  at  t  =  0 ,  so  that  5(0)  can  be  recovered  by  its  limit  as  t  — *  0 .  The  occurence  of  the 
term  ir(a)  in  equation  (22)  suggests  the  utility  of  working  with  defined  by  =  t^(a)/i r(a). 
We  illustrate  the  above  ideas  in  Figure  1  below. 


4>(a) 


The  Sharp-  Lotka  equation  gives  for  T  <  t  <  L 


b(t  -  a)B(a)  da  + 


/ 


a)B(a)da+  /  b(a)d>(a  —  t)  da 


and  since  JtL  b(a)4>(a  —  t)da  =  /0L_<  b(a  +  t)rj)(a  +  T)da  is  known  for  T  <  t  <  L  we  have  that  B(t) 
satisfies  the  equation 


B(t)  -  j  b(t  -  a)B(a)da  =  ai(t)  for  t  >  T  (23) 

where  <*i(0  =  J(L  b(a)i)(T  +  a  -  t)da  +  J*_T  b(a)i){T  —  t  +  a)da  is  defined  for  T  <  t  <  L  and  can 
be  computed  from  tt,  0  and  i>(a). 

By  lemma  1,  equation  (23)  can  be  solved  uniquely  for  the  function  B(t)  for  t  >  T  and  the  continuity 
of  ai(t)  shows  that  this  solution  will  be  continuous  on  (T,LJ. 

From  the  fact  that  6(0)  =  0,  we  can  show  that 


T  Li 

lim  B^(t)  =  lim  a\r\t)  =  (-l)r {  f  (3(a)&r\a)da  +  f  0(a)foT\a)da} 
t—T+  t->T+  Jo  Jj 
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From  (22)  it  follows  that 


lim_  fl<r>(t)  =  (-l)r0(r)(O) 

From  the  Sharp-Lotka  equation  it  follows  that  if  0  is  continuous  (actually  even  if  0  is  piecewise 
continuous)  then  the  birth  rate  B(t)  must  have  the  same  differentiability  as  the  net  maternity 
function  b(t) .  Thus  the  condition 


T  L 

0(r)(O)  =  [  /3(a)flr\a)da  +  [  /?(a)0<r>(a)da 
Jo  Jt 

on  the  overposed  data  is  necessary  if  it  actually  arises  from  a  physically  reasonable  initial  distribu¬ 
tion  and  a  smooth  birth  function. 

Suppose  now  that  nL  <  T  <  (n  +  1  )L  for  n  >  1 .  Then  0(a)  =  p(a,  T )  being  given,  implies  that 

B(t)  is  known  for  T  —  L  <  t  <  T.  We  show  this  implies  that  B(t)  can  be  recovered  over  the  range 

max{0,  T  —  2 L)  <  t  <  T  —  L .  For  t  >  £, 


B(t)  =  [  b(a)B(t-a)da 

Jo 

=  f  b(t-a)B(a)da 
Jt-L 

fT~L 

=  /  b(t  -  a)B(a)  da  +  /  b(t  -  a)B(a)  da . 

Jt-L  Jt-l 


If  t  >  max{L,T  -  L}  then  both  B(t)  and  J^_Lb(t  -  a)B(a)da  are  known.  This  implies  that  the 
function  a2(t)  which  is  equal  to  their  difference  is  known  for  the  range  max{Z.,T  —  L)  <  t  <  T. 
This  leads  to  the  convolution  equation 


b(t  -  a)B(a)da 


(24) 


which  can  be  solved  uniquely  for  B{t )  for  t  in  the  range  max{0,T  -  2X}  <  t  <  T  -  L.  By 
continuing  this  argument  we  can  reconstruct  the  birth  function  B(t)  over  the  range  0  <  t  <  T. 

Note  however  that  each  time  we  compute  B(t )  over  an  interval  T  —  (n  +  1  )L  <  t  <  T  —  nL  from 
previously  computed  values  on  the  interval  T-nL<t<T-(n+  1  )L  we  must  solve  a  first 
kind  convolution  integral  equation  whose  kernel  vanishes  to  order  m  at  a  =  0.  Clearly  the  more 
times  we  have  to  solve  this  ill-posed  problem  the  more  inaccurate  our  knowledge  of  the  function 
B(t)  will  be.  Thus  we  should  expect  that  if  T  <  L  then  recovery  of  B(t),  which  entails  solving 
the  second  kind  equation  (23),  should  be  straightforward  and  the  solution  obtained  should  depend 
continuously  on  the  data  ir,  (3  and  0.  If  T  >  nL  then  we  must  solve  n  first  kind  equations  of 
the  forru  (24)  in  order  to  recover  B(t )  for  0  <  t  <  L.  Thus  the  longer  we  wait  to  take  the  census 
that  provides  the  values  of  0(a),  the  poorer  we  should  expect  any  reconstruction  of  the  initial 
distribution  to  be,  and  that  a  significant  difference  should  occur  if  T  is  chosen  to  be  greater  than 
the  lifespan  L ,  and  a  further  degradation  should  be  encountered  as  T  is  allowed  to  exceed  each 
subsequent  generation. 
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Having  recovered  the  function  B(t)  for  ail  t  >  0,  the  Sharp- Lotka  equation  gives  the  following 
representation  for  <j>  on  0  <  t  <  L 

b(t  -  a)B(a)  da  =  03 (t). 

This  can  be  rewritten  as  the  convolution  equation  f0‘  b(L  —  1 +  s)4(s)  ds  =  a2(L  —  t)  and  since  0 
vanishes  identically  on  the  interval  [a2,L\  this  is  equivalent  to 

t-(L-a2) 

b(L  -  t  +  s)4>(s)  ds  =  az(L  -  t)  (25) 

from  which  it  follows  that  there  is  at  most  one  function  4>  defined  on  the  interval  [0,02)  that  solves 
the  inverse  problem.  A  necessary  condition  for  a  bounded  solution  of  (25)  is  that  the  function  a3(f) 
vanish  with  its  m  +  1  derivatives  at  the  point  t  =  a% .  This  is  a  further  condition  on  B(t)  which 
corresponds  to  a  condition  on  the  overposed  data. 

As  in  the  previous  problem  we  can  convert  (25)  to  a  second  kind  equation  by  m+1  differentations. 

-6<m>(a2)0(O  +  f  &(m+1>(a2  -  t  +  3)4(3)  ds  =  <4m+1)(a2  -  t)  (26) 

Jo 

and  obtain  the  uniqueness  of  a  continuous  solution  0(a) .  Here  the  fact  that  0(m\a2)  /  0  is  crucial. 
Once  again  we  have  not  given  any  conditions  that  guarantee  a  nonnegative  solution. 

In  the  case  that  T  <  L  we  may  utilize  the  information  that  <£(a)  =  4(a  +  T)  for  a  <  L  —  T  and 
rewrite  (25)  as 

t  _  rL-T 

b(a2  - 1  +  s)4>(s)  ds  =  a3(a2  - 1)  -  /  b(a2  —  t  +  s)4(a  +  T)da  f orL  —  T<t<a2  (27) 

,-T  Jo 

The  above  analysis  demonstrates  the  recovery  of  the  initial  age  distribution  for  the  range  0  <  a  < 
a2 .  What  about  the  interval  (a2,  L)?  If  T  >  L  -  a2  then  we  can  never  recover  these  values.  Those 
individuals  in  the  initial  population  whose  age  is  greater  then  a2  do  not  participate  in  the  birth 
process  and  have  all  died  within  a  time  period  of  length  L  —  a2. 

Given  the  remarks  following  lemma  1,  we  would  not  expect  that  the  initial  data  4(a)  would  depend 
continuously  in  the  supremum  norm  on  the  function  0(a)  or  any  of  its  derivatives.  That  this  is 
so  can  be  seen  by  taking  0„(a)  =  \/nx[0, 1/n],  and  considering  the  values  it  imposes  on  the  final 
data  0n(a)  =  p(a,L)  at  a  time  equal  to  the  lifespan.  We  assume  that  0  and  x  are  C°°  functions. 

0n(t)  =  b(a  +  t)4>n(a)da,  and  thus  for  any  integer  r 


f  ■ 


b(t  +  a)4(a)  da 


= B(t)  ~  r 

Jo 


#(nr)(0=  \L  1  fr(r)(o  +  t)<j>n(a)  da 

Jo 
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and  thus 


sup  <  4f  sup  6<'>(a). 

V  n  0 <a<L 


0  <t<L 


For  any  r,  the  function  ^n\t)  tends  to  zero  in  the  supremum  norm  as  n  — *■  oo,  but  4>n  does  not 
tend  to  a  continuous  function.  The  Sharp- Lotka  equation  gives  Bn(t)  =  b(a)Bn(t  -  a)da  -I-  ^,,(1) 
and  by  lemma  1  we  see  that  Bn\t)  will  also  tend  to  zero  as  n  -»  oo  for  any  r.  Since  V’n(o)  = 
p(a,  L)  —  x(a)Bn{L  -  a)  this  shows  that  the  final  data,  tM°)»  and  ^  *ts  derivatives  can  tend 
uniformly  to  zero  as  n  — ►  oo,  yet  the  initial  distribution  fails  to  converge  to  a  continuous  function. 


Some  numerical  experiments 

In  order  to  demonstrate  the  feasibility  of  recovering  the  initial  age  distribution  from  <j>,  A  and 
either  P(t)  or  p(a,  T) ,  we  give  the  results  of  some  numerical  experiments.  By  this  means  we  hope 
to  show  the  extent  to  which  numerical  recovery  of  the  initial  age  distribution  can  be  recovered  and 
some  of  the  complicating  factors  in  the  process. 

It  was  assumed  that  the  birth  function  /3  and  the  life  table  function  x  were  known.  The  maximum 
life  span  L  was  chosen  to  be  10.  We  used  a  life  table  x(a)  as  shown  in  figure  2.  This  was 
constructed  by  taking  a  cubic  function  passing  through  the  points  (0,1),  (1,0-8),  (7,0.3)  and 
(10,0).  If  desired,  a  death  function  A(a)  can  then  be  obtained  by  setting  A(a)  =  x'(a)/x(a).  We 
used  two  slightly  different  birth  functions  (3\  and  /32  as  we  show  in  figure  3.  The  first  of  these 
satisfies  the  conditions  of  Al,  and  is  a  a  twice  continuously  differentiable  function  on  [0,  L\  with 
support  on  the  interval  [2,8]  The  function  /?2  is  similiar  except  it  has  a  discontinuity  in  its  first 
derivative  at  a  =  /?2  as  required  by  Bl. 

In  order  to  procure  data  for  the  inversion  process  we  took  an  initial  function  <£oct(a )  as  shown  in 
figure  2  and  solved  the  direct  problem  numerically  to  obtain  either  the  value  of  the  total  population 
P(t)  over  an  interval  [0,  L\,  or  p{a,T )  for  some  fixed  time  T  >  0.  This  overposed  data,  as  well  as 
/3(a)  and  x(a)  were  passed  to  the  inversion  routines  to  recover  <t>(a). 

Fig  2.  Life  Table  and  Actual  Initial  Distribution 
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Fig  3  Birth  Function* 


nr 


These  functions  are  not  meant  to  represent  actual  values  for  any  particular  species,  but  they  contain 
some  of  the  features  common  to  a  large  class  of  population  structures. 

Some  details  on  these  numerical  procedures  will  now  be  given. 

For  the  first  numerical  test,  we  used  the  birth  function  (3\(a ),  solved  the  direct  problem  using 
4>act(a)  to  obtain  a  total  population  function  P(t)  at  a  discrete  series  of  gridpoints  tj  on  the  time 
interval  0  <  t  <  L.  A  plot  of  the  function  P(t)  resulting  from  these  values  of  t,  (}\  and  <j>act  is 
shown  in  figure  4. 

Fig  4  Plots  of  the  Overposed  Data 


The  values  of  P(ti)  were  then  passed  to  our  inverse  solving  code  to  attempt  to  recover  the  values 
of  <f>.  To  achieve  this  we  recovered  in  succession  the  functions  7,  $2  and  finally  <f>.  For  the  first 
and  third  of  these  we  must  solve  a  first  kind  Volterra  equation  (16),  or  convert  it  to  second  kind  by 
differentation,  and  for  the  computation  of  'Pj  we  need  to  solve  a  second  kind  Fredholm  equation 
(18). 

Where  do  the  difficulties  lie?  Throughout  we  assume  that  both  t  and  /?  are  able  to  be  computed 
exactly.  Thus  in  theory  we  can  solve  for  7  to  within  any  precision  demanded.  Numerically,  the 


12 


question  is  whether  to  solve  the  Volterra  equation  as  a  first  kind  equation,  (16),  or  second  kind 
form,  (17).  The  conventional  wisdom  is  to  take  the  former  approach,  especially  if  the  evaluation 
of  the  kernel  and  free  term  may  be  subject  to  error.  While  we  have  assumed  this  is  not  the  case 
here,  there  is  another  factor  to  consider.  Virtually  all  algorithms  for  solving  first  kind  Volterra 
equations  of  the  form  k(x,  t)f(t)dt  =  g(x)  assume  that  the  kernel  k(x,  t)  does  not  vanish  on  the 
diagonal  x  =  t .  Violation  of  this  condition  leads  to  instabilities.  Rather  than  obtaining  increased 
acccuracy  with  decreasing  stepsize,  this  case  gives  increase  in  accuracy  with  diminishing  stepsize 
only  for  sufficiently  large  stepsize.  As  the  stepsize  is  further  reduced  the  solution  loses  accuracy 
rapidly.  This  phenomenon  is  typical  of  ill-posed  problems.  In  our  problem  if  we  use  equation  (16), 
k(x,t)  =  x(L  -  x  +  t),  and  the  non- vanishing  condition  is  violated.  In  fact  k(x,x)  =  0  for  all 
i .  We  therefore  took  the  compromise  approach  of  repeatedly  differentiating  the  equation  until  the 
resulting  kernel  had  a  non-vanishing  diagonal,  but  stopping  short  of  the  full  conversion  to  second 
kind  which  would  require  differentiating  the  equation  one  more  time.  For  our  situation,  since  we 
are  assuming  that  m  =  1  in  Al,  we  therefore  differentiate  equation  (16)  once  and  determine  7  as 
the  solution  of 


ir'(L  —  t  +  0)7(0)  da . 


(28) 


Here  the  kernel  satisfies  k(t,  t)  =  tt'(L)  ^  0.  The  value  of  n'(L)  is  approximately  —0.2.  We  solved 
the  above  equation  using  a  discretization  method  based  on  the  midpoint  rule,  [Linz,  1985].  For  the 
second  stage  we  must  recover  $2  as  the  solution  to  (18).  We  solved  this  equation  by  the  Nystrom 
method  [Baker,  1979],  which  converts  the  integral  equation  to  a  linear  system  of  equations.  The 
kernel  of  the  integral  equation  depends  only  on  it  and  7,  and  with  our  assumptions  we  should 
be  able  to  evaluate  this  accurately.  The  free  term  d(t)  depends  on  x,  0  and  also  the  measured 
population  P(t).  Depending  on  the  quality  of  this  data  we  may  or  may  not  be  able  to  accurately 
compute  the  free  term.  Any  inaccuracies  in  this  will  be  magnified  by  the  condition  number  of 
the  matrix  of  the  system.  Of  course,  if  there  is  an  eigenvalue  of  this  matrix  near  the  value  —  1 
then  this  condition  number  could  be  very  large.  We  investigated  a  wide  selection  of  reasonable 
values  for  the  birth  and  death  functions  and  in  no  case  was  there  a  problem  with  the  matrix 
inversion.  For  the  values  of  tt  and  /?  shown  above  the  resulting  matrix  has  condition  numbers  of 
approximately  9.5  for  the  case  of  (3\  and  2.8  for  the  case  of  /?2  •  Thus  the  error  made  in  the  recovery 
of  rj)  should  be  of  a  similar  order  of  magnitude  to  the  error  in  the  observed  P(t).  There  are  still 
some  difficulties  however.  It  follows  from  equation  (14)  that  the  function  $ 2 (0  should  satisfy  the 
necessary  conditions  ^(T)  =  =  0  f°r  recovery  of  a  bounded  initial  distribution  4>(a).  From 

the  integral  equation  (18),  since  K(L,a)  =  Kt(L,a )  =  0,  it  follows  that  the  function  d(t)  should 
satisfy  d(L)  =  d'(L)  =  0.  Given  errors  in  P{t)  this  will,  in  general,  not  be  true.  Thus  although  the 
magnitude  of  the  error  on  the  computed  values  of  $2  may  be  small,  and  the  average  value  of  the 
relative  error  small,  the  relative  error  in  the  vicinity  of  the  point  t  —  L  may  be  considerable.  This 
will  significantly  affect  the  computation  in  the  third  stage.  We  must  solve  equation  (14),  and  once 
again  we  are  faced  with  the  question  of  whether  to  differentiate  this  equation  prior  to  numerical 
solution,  and  if  so,  how  many  times.  The  issues  are  slightly  different  from  those  encountered  in  the 
first  stage.  The  kernel  is  identical  in  both  cases,  but  whereas  the  free  term  in  (16)  was  assumed 
to  be  known  to  high  accuracy,  this  may  not  be  the  case  with  equation  (14).  Since  errors  made  in 
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solving  a  Volterra  equation  for  small  values  of  t  will  always  contribute  to  the  solution  for  later  t , 
it  is  imperative  to  have  good  starting  values.  However,  the  values  of  <j>{a)  for  a  small  will  depend 
on  the  values  of  $2  near  L ,  which  is  precisely  the  place  where  this  function  has  the  largest  relative 
error.  This  factor  gives  the  greatest  contribution  to  the  error  in  effectively  computing  the  initial 
distribution  i>. 

With  sufficiently  accurate  data  on  P{t )  we  were  able  to  recover  the  values  of  the  function  <j>  to 
within  any  desired  accuracy.  We  then  ran  the  inverse  code  with  a  value  of  P(t)  that  had  a  certain 
percentage  of  noise  added.  To  achieve  this  we  added  normally  distributed  random  values  to  the 
(accurate)  overposed  data  computed  by  the  direct  solver.  This  noisy  data  was  then  smoothed  by  a 
smoothing  spline  routine  before  being  passed  to  the  inverse  solver.  In  figure  5  we  show  the  results 
for  1%  and  3%  noise  in  the  overposed  data  P(t) .  The  function  4>act  is  shown  as  a  dashed  curve. 

Fig  5.  Computed  <f>  with  noise  in  P(t) 


We  also  did  similar  computations  for  the  second  inverse  problem.  Results  are  shown  in  figure  6  for 
5%  and  10%  error  in  the  overposed  data.  We  used  02  48  the  birth  function,  and  used  a  value  of  6 
for  the  final  time  T.  The  results  are  shown  in  figure  6 

Clearly,  the  second  problem  is  less  sensitive  to  noise  in  the  overposed  data  than  the  first.  There  are 
several  reasons  for  this.  First,  we  are  computing  the  function  <f>(a)  over  a  shorter  range,  4  <  a  <  8 
in  this  case.  Second,  we  have  to  solve  one  first  kind  and  one  second  kind  Volterra  equation  when 
p(a,  T )  is  the  overposed  data.  The  overposed  data  only  appears  in  the  free  term  of  the  first  of  these 
equations,  (26).  The  values  obtained  from  this  equation  are  used  to  compute  the  free  term  for  the 
second  equation  (23).  In  both  equations  the  kernels  can  be  computed  directly  in  terms  of  ir  and  0 . 
In  contrast,  when  P(t)  is  the  overposed  data,  the  solution  of  a  first  kind  Volterra  equation  is  used 
to  compute  the  kernel  of  a  second  kind  Fredholm  equation,  and  the  result  used  in  another  first  kind 
equation.  Thus  the  errors  tend  to  amplify  with  each  sucessive  computation,  and  there  are  simply 
more  of  them  (and  more  ill-posed  ones)  in  the  first  Inverse  problem.  Third,  although  (27)  is  first 
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Fig  6.  Computed  4>  with  noise  in  p(a,  T ) 


kind  with  the  kernel  k(t,a)  such  that  k(t,  t)  =  0,  when  we  differentiate  this  equation  the  resulting 
equation  has  a  kernel  that  does  not  vanish  on  the  diagonal,  and  in  fact  for  the  function  /? 2  we  have 
/ 32(a2 )  =  —0.8,  which  is  a  relatively  large  value,  and  allows  an  accurate  and  stable  scheme  to  be 
used  in  the  recovery  of  <j>(a) . 

What  would  happen  if  we  used  /32  in  the  recovery  of  <j>  with  P(t)  as  the  overposed  data  or  the 
function  (3\  when  p(a,  T )  was  overposed  data? 

Ix»  the  analysis  of  section  2  we  assumed  that  the  birth  function  had  2m  +  1  continuous  derivatives, 
which  in  our  examples  requires  that  (3  be  thrice  continously  differentiable  on  [0,  L ] .  This  is  true 
of  (3\ ,  but  certainly  not  the  case  for  /?2  which  has  a  discontinuous  derivative  at  a  =  a2 .  To  test 
how  lack  of  regularity  in  the  birth  and  death  processes  affects  the  numerical  recovery  of  4>{a)  we 
ran  the  above  codes  with  accurate  values  of  P(t)  but  using  /?2  in  place  of  /?i .  The  values  obtained 
had  no  relation  to  <f>act  whatsoever.  A  similar  result  was  obtained  when  we  used  /?i  in  the  second 
inverse  problem. 

The  explanation  for  this  is  as  follows.  With  the  function  ,  (which  was  built  in  such  a  manner  to 
have  3  continuous  derivatives)  we  had  a  continuous  free  term  in  equation  (15),  which  was  used  to 
recover  the  function  7 .  With  /?2  the  free  term  is  discontinuous  and  mathematically  one  must  expect 
the  same  to  be  true  of  the  7  obtained  as  a  solution.  This  has  significant  effects  on  the  accuracy  of 
the  quadrature  schemes  used  to  solve  the  Fredholm  equation  (18)  the  kernel  of  which  depends  on 
integrals  of  7.  Thus  even  although  we  may  compute  the  free  term  d(t)  (depending  on  x  and  P) 
accurately  there  will  be  numerical  errors  in  the  evaluation  of  the  kernel  of  this  equation.  Indeed 
K(t,a)  will  not  be  continuously  differentiable  in  t  and  therefore  neither  will  be  the  solution  'P2 . 
Once  again  we  will  have  errors  in  $2 ,  and  these  errors  will  manifest  themselves  as  oscillations  due 
to  behaviour  of  the  derivative.  When  we  attempt  to  recover  <f>{ o)  from  (14)  it  may  in  fact  be  better 
to  solve  this  first  kind  equation  without  prior  differentiation  since  we  prefer  not  to  differentiate  the 
free  term  ^2 .  However,  without  differentation,  the  numerical  schemes  for  solving  the  first  kind 


15 


equation  will  be  limited  to  low  accuracy  by  the  restriction  on  the  stepsize.  In  practice  we  found 
that  when  we  replaced  0\  by  a  virtually  identical  function  except  that  we  joined  its  component 
parts  so  that  only  the  first  2  derivatives  were  continuous,  we  were  able  to  effectively  recover  <f>(a) 
from  accurate  overposed  data.  However,  for  a  given  set  of  grid  sizes  the  error  in  computing  <f>  was 
more  than  ten  times  the  error  in  computing  the  initial  data  with  0\ .  From  a  numerical  as  well 
as  a  mathematical  standpoint  the  regularity  assumption  Al  on  the  birth  and  death  functions  are 
therefore  necessary. 

As  noted  above,  we  were  unable  to  recover  4>{a)  from  p(a,  T)  when  0\  was  the  birth  function.  The 
condition  0'(a2)  ^  0  is  essential  for  the  stability  of  the  numerical  schemes.  In  order  to  determine 
<t>  from  x ,  ,  and  p(a,  T )  we  modified  our  procedure  slightly. 

There  are  two  different,  but  closely  related,  possibilities  in  dealing  with  the  fact  that  both  0\ (a2) 
and  («2 )  are  zero.  First,  one  could  modify  the  function  0\  in  a  small  neighborhood  of  the  point 
a  =  a2  so  that  it  was  continuous  on  [0,  L\ ,  twice  continuously  differentiable  on  [0,a2),  and  the 
derivative  at  a  =  a2  was  non  zero.  Second,  one  could  retard  the  argument  of  0\  in  equation  (27), 
effectively  solving  this  equation  with  the  value  of  a2  replaced  by  a2  —  e  in  both  the  kernel  and 
free  term.  This  gives  k(t,  t)  =  b(a2  -  c)  which  is  now  nonzero  for  e  >  0 .  How  is  €  chosen,  or 
the  modification  near  a  —  a2  selected  for  maximum  performance?  How  would  you  know  you  had 
obtained  such  a  maximum?  One  possible  method  is  to  fine  tune  the  choice  of  parameters  using 
known  solutions.  The  result  obtained  by  retarding  the  argument  in  the  kernel,  choosing  c  =  0.3 , 
is  shown  in  figure  7. 

Fig  7.  Computed  <f>  using  modified  /32 


The  relatively  large  errors  near  the  point  L  —  T  =  4  are  due  to  inaccuracies  in  the  Volterra  solver 
from  inaccurate  evaluation  of  the  kernel  (which  depends  on  6(a))  near  the  point  a  =  a2  where 
0(a)  was  modified.  Of  course,  we  could  smooth  the  function  <f>  near  this  point,  using  the  accurately 
determined  values  for  a  <  L  —  T  and  the  reasonable  well  computed  values  for  a  larger  than  and 
away  from  L  -  T .  In  addition,  we  could  modify  the  result  to  guarantee  a  postive  solution.  These 
would  significantly  improve  the  results  obtained. 
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1.  Introduction. 

This  paper  considers  the  problem  of  determining  the  unknown  coefficient  A  in  the  first  order 
hyperbolic  equation 

Pt(a,  f)  +  pa(a,t)  +  Xp(a,t)  =  0.  (1.1) 

Equations  such  as  (1.1)  are  ubiquitous,  but  our  choice  of  independent  variables  indicates  a  particular 
application  we  have  in  mind;  a  stands  for  age,  t  for  time  and  the  density  p(a,t )  denotes  the  number 
of  individuals  of  age  a  at  a  particular  time  t .  We  assume  A  >  0  and  in  this  case  A  denotes  a 
death  function,  Xda  being  the  probability  of  a  death  occurring  to  a  given  individual  in  the  time 
period  a  to  a  +  da.  Since  this  quantity  should  depend  on  the  age  of  the  individual  A  depends 
on  a,  and  in  the  case  of  competition,  should  also  depend  on  p(a,t).  This  dependence  may  be  in 
fact  be  on  the  total  population  at  time  t,  P(t )  =  J  p(a,t)da ,  and  there  certainly  are  circumstances 
where  A  =  A(a,  P )  is  the  appropriate  model.  While  we  are  using  the  language  of  age  structured 
population  dynamics,  our  “  species”  could  be  also  be  mechanical  parts,  p(a,t)  would  denote  the 
number  of  parts  of  age  a  still  in  service  at  time  t ,  and  A  would  be  age  dependent  rate  of  failures. 

In  addition  to  (1.1)  we  would  normally  be  given  initial-boundary  data;  typically  the  initial  age 
distribution  p(a,  0)  together  with  information  on  the  boundary  a  =  0.  This  may  be  the  actual 
value  of  p(0,f),  (the  number  of  new  parts  put  into  service  at  time  t)  or  an  equation  involving  this 
quantity,  such  as  p(0,t)  =  f  0(a)p(a,t)da.  The  latter  case  of  a  non-local  boundary  condition  is 
the  usual  one  in  population  dynamics,  0(a)  is  the  birth  function  and  0da  is  the  probability  of  an 
individual  of  age  a  giving  birth  in  the  interval  a  to  a  -I-  do.  Thus  the  usual  direct  problem  (or  a 
simplification  of  such)  is:  given  the  value  of  p(a,0),  boundary  data  along  a  -  0,  and  the  death 
function  A;  compute  the  value  of  p(a,t). 

In  practice  our  knowledge  of  the  death  process  may  be  incomplete.  Even  in  the  case  when  A 
depends  only  on  a  we  may  not  be  able  to  obtain  accurate  data  on  the  age  of  death  or  failure.  The 
case  A  =  A(a)  is  the  age  structured  equivalent  of  the  familiar  Malthus  law,  and  to  obtain  a  more 
accurate  picture  some  dependence  of  A  on  either  p  or  P  must  be  assumed.  How  can  such  a  law  be 
determined,  and  what  are  the  reasonable  experiments  that  would  allow  the  recovery  of  the  death 
function?  In  the  language  of  inverse  problems,  what  overposed  data  can  be  prescribed  that  allows 
the  determination  of  the  coefficient  A? 

In  the  second  section  of  this  paper  we  shall  show  that  one  can  recover  the  value  of  A  =  A(a) 
from  a  measurement  of  the  age  structure  p(a,T),  at  a  later  time  T  and  the  specification  of  p(0,t). 
For  the  case  of  the  boundary  condition  p(0,t)  =  f  0(a)p(a,t)da ,  we  show  that  the  inverse  problem 
can  be  reformulated  as  a  nonlinear  integral  equation  for  the  death  function,  and  indicate  how  the 
equation  may  be  used  to  recover  this  function. 

In  the  third  section  we  consider  the  problem  of  determining  the  dependence  of  the  death 
process  on  total  population.  We  will  not  attempt  to  recover  an  unknown  function  A  of  the  two 
independent  variables  a  and  P.  As  a  minimum  this  would  require  giving  overposed  data  that 
corresponds  to  a  function  of  two  variables,  and  this  is  more  than  we  envision  as  being  reasonable. 
Instead  we  shall  assume  that  A(a,  P)  has  a  known  form  and  that  the  dependence  on  age  has  already 
been  determined.  Two  possibilities  give  reasonable  physical  models:  In  the  first  we  assume  that 
the  death  rate  is  a  simple  sum  of  the  natural  and  environmental  rates,  that  is,  A  has  the  form 
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A yv(a)  +  A e(P),  where  X^  is  the  death  rate  due  to  natural  causes  and  A^  is  the  death  rate  due 
to  the  environment.  In  the  second  case  we  assume  that  A (a,P)  =  A/y(a)[l  +  Ag(.P)].  The  first 
form  assumes  that  the  environmental  pressure  is  the  same  for  all  ages,  the  second  assumes  that  it  is 
coupled  to  age  so  that  environmental  pressure  is  greater  on  the  age  group  with  a  high  natural  death 
rate.  We  expect  that  A#  is  small  for  relatively  small  values  of  P,  so  that  A  «  A ^(a)  and  that  the 
process  is  well  understood  in  this  case.  The  problem  is  to  extrapolate  this  knowledge  to  attempt 
to  recover  the  dependence  of  A  on  P  when  the  latter  may  strongly  affect  the  death  process. 

Here  the  appropriate  overposed  data  could  be  the  value  of  P(t)  itself  since  frequently  this  is  a 
quantity  that  is  easy  to  measure.  We  shall  show  that  one  can  in  fact  recover  A e  from  knowledge  of 
initial- boundary  data  and  P(t)  over  some  range.  For  each  of  these  recovery  problems  we  consider 
the  two  types  of  boundary  condition,  B(t)  =  p(0,t)  given  and  p(0,t)  =  J  0(a)p(a,t)da.  As 
expected,  the  second  of  these  leads  to  a  more  complex  recovery  procedure. 

The  direct  problem  involving  (1.1)  can  be  considered  to  be  well  understood,  even  when  A  and 
0  depend  on  p ,  and  possibly  in  a  non-local  way.  On  the  other  hand,  comparatively  little  work  has 
been  done  even  for  the  most  obvious  inverse  problems.  This  is  not  to  say  that  the  importance  has 
of  identifying  the  birth  and  death  processes  has  not  been  acknowledged  in  the  literature.  This  is  far 
from  the  case,  but  most  of  the  model  testing  has  been  confined  to  the  identification  of  parameters 
that  appear  in  assumed  forms  rather  than  the  recovery  of  the  actual  functional  relationship.  In 
previous  work  the  authors  have  investigated  the  problems  of  recovering  the  initial  distribution 
p(a,  0)  from  certain  overposed  data,  [6]  and  the  determination  of  the  birth  function  0  from  a 
measurement  of  total  population,  [7]. 

We  introduce  our  notation  and  some  of  the  ideas  used  in  the  solution  of  the  direct  problem  for 
(1.1).  We  shall  take  as  our  vocabulary  the  language  of  population  dynamics. 

By  C[a,6]  we  mean  the  usual  space  of  functions  continuous  on  [a,  6]  and  the  norm  on  this 
space  we  denote  by  || . . .  ||  jo  ^  .  Several  subsets  will  be  used.  Cm  is  the  subset  consisting  of  those 
functions  whose  range  lies  in  the  interval  [0,Af].  C+  [a,  6]  is  the  subset  of  non-negative  valued 
functions  on  [a,  6],  and  Co[a,  6]  is  the  subspace  of  functions  that  vanish  at  the  left  endpoint  a. 

We  shall  drop  the  interval  indication  when  no  confusion  can  arise.  We  assume  that  the  species 
has  a  maximum  life  span  of  L ,  although  this  quantity  may  be  infinite.  In  order  to  ensure  that  death 
must  occur  by  age  L,  the  (strictly  positive  valued)  death  function  must  satisfy  fQL  X(a,-)da  =  oo. 
Consequently  A  will  be  singular  if  L  is  finite.  In  the  case  of  a  linear  model  where  the  death 
function  depends  only  on  the  age  a,  it  is  useful  to  use  the  life  table  function  ?r(a)  defined  by 
n (a)  =  exp(—  /0“  A(s)ds).  This  is  a  strictly  decreasing  function  of  a  with  tt(0)  =  1  and  k{L)  —  0. 
It  is  the  probability  that  an  individual  will  live  to  age  a. 


2 


2.  Recovery  of  an  age-dependent  death  rate. 

We  seek  to  recover  the  pair  of  functions  (A,  p)  from 


Pt  +  Pa  +  A  (a)p  =  0 

(2.1) 

with  the  initial  condition 

p(a,  0)  =  <p{a)  , 

(2.2) 

one  of  the  boundary  conditions 

O 

r*. 

II 

*■ 

(2.3a) 

p(0,t)=  I  (3(a)p(a,t)da 

Jo 

(2.3  b) 

In  order  to  achieve  this  we  must  give  some  additional  information,  which  we  refer  to  as  overposed 
data,  since  if  A  were  known,  the  prescription  of  other  information  would  lead  to  an  overdetermined 
problem.  We  shall  use  as  overposed  data 


V’(a)  =  p(a,T) 


(2.4) 


which  corresponds  to  the  value  of  p{a,t)  at  a  later  time  T.  Thus,  we  are  given  the  conservation 
equation  (2.1),  one  of  the  renewal  laws  (2.3),  and  the  results  of  two  censuses  taken  a  time  period 
T  apart. 

Instead  of  the  unknown  function  A ,  we  use  the  life  table  x .  Once  x  has  been  determined  we 
can  recover  A  as  the  logarithmic  derivative  of  this  function. 

Of  the  two  boundary  conditions,  (2.3a)  leads  to  a  considerably  easier  recovery  problem,  and 
we  shall  consider  this  case  first. 

If  we  integrate  equation  (2.1)  with  A  =  A(o)  along  characteristic  coordinates,  we  obtain 


p(a,t) 


p(0,t-a)e  Jo  if  a  <  t; 

<p(a  —  t)e  J—tX(a^da  if  t  <  a. 


(2.5) 


Note  that  if  (2.3a)  is  used,  that  is  the  Dirichlet  data  p(0,  t)  =  B(t)  is  prescribed,  then  (2.5)  with 
t  =  T  can  be  solved  directly  to  recover  A(a) .  If  T  <  L  then  we  have  the  explicit  solution 


A(«) 


d_ 

da 


fl(T-a) 

V'(a) 


M«  -  T)  +  fe  [log 


if  a  <  T; 
if  a  >  T; 


(2.6) 


and  if  T  >  L  then  only  the  first  of  these  equations  is  required.  It  is  therefore  clear  that  if  a  physically 
meaningful  solution  A(a)  is  to  obtained,  then  the  above  equations  constrain  the  admissible  data, 
B ,  ip.  In  fact  in  order  to  obtain  a  continuous  solution  A(a)  to  (2.6)  it  is  necessary  that  <j>,  ip  and 
B(t)  be  positive  and  continuously  differentiable  on  [0,T)  with  ip  having  these  properties  on  [0,  L\. 
The  nonintegrability  condition  Xda  =  oo  will  in  fact  follow  from  ip{L)  =  0.  The  positivity  of 
A  depends  on  the  interaction  between  the  overposed  data  ip  and  the  data  for  the  direct  problem 
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<p  and  B.  Even  in  the  relatively  straightforward  case  of  (2.3a)  there  are  enormous  compatibility 
constraints,  and  these  will  be  greater  when  boundary  condition  (2.3b)  is  used. 

The  second  equation  in  (2.6)  shows  that  if  T  *  L  and  if  A  (a)  has  been  recovered  for  a  <  T , 
then  we  can  obtain  the  values  of  A(a)  over  the  remaining  interval  (T,  L)  in  terms  of  the  initial  and 
final  data.  This  is  true  regardless  of  which  of  the  boundary  conditions  (2.3a)  or  (2.3b)  are  used. 

The  second  case,  where  the  renewal  law  uses  the  specification  of  the  function  /?,  is  more 
complex.  If  we  integrate  (2.5)  in  a  from  a  =  0  to  a  =  L,  then  we  obtain  the  Sharp-Lotka  Equation 

B(t)  =  [  B(t  -  a)/3(a)x(a)  da  +  f  /3(a)x(a)^ — 77  da 

Jo  Jt  T(a  ” l) 

where  x(a)  =  exp(—  J0°  A(s)ds).  The  birth  rate  B  is  a  function  of  x  rather  than  prescribed  data, 
and  we  use  the  notation  B  =  B(t;  x) .  Let  us  assume  that  T  <  L  for  the  moment.  For  a  given 
function  x(a)  defined  on  [0,  L]  we  can  define  the  mapping  x  B(  •  ;  x)  where  B  is  the  solution 
of  the  second  kind  Volterra  equation 

B(t)  =  B(t-a)0(a)it(a)da  +  J  /3(a)<j>(a- "y  da  +  jT  /?(<*)<£(«  ~0  da"  (2-7) 

Here  S  is  that  operator  that  maps  the  values  of  a  function  with  domain  [0,  L\  to  one  defined  on 
[0,T]  by 

(*■(<*)  if  0  <  a  <  T; 

<r{x(a)}  if  T  <  a  <  2T; 

5x(a)  =  |  if  2  T<a<  3  T;  (2-8a) 


and  o  is  defined  by 

<r{x(a)}  =  ~ T)  *  ( 2-8b ) 

and  uses  the  second  line  of  equation  (2.5)  to  map  values  of  x  on  [T,  2T)  onto  values  on  [0,  T] . 
Using  this  we  can  now  write  the  first  line  of  equation  (2.5)  as 


w(a)  =  W^T)  s  TW 


We  can  view  (2.9)  as  a  nonlinear  equation  for  the  restriction  of  the  function  x(a)  defined  on  the 
interval  [0,  T).  If  we  have  obtained  a  solution  of  this  equation,  then  we  can  extend  it  to  successive 
intervals  [T, 2T],  [2T,3T],  ...,  using  (2.8).  It  follows  directly  that  the  function  x(a)  must  satisfy 
the  endpoint  conditions  x(0)  =  1  and  x(T)  =  il>(T)/<f>( 0).  Can  we  show  that  the  mapping  T  has 
a  (unique)  fixed  point,  and  if  so,  can  this  fixed  point  be  obtained  by  the  usual  iteration  scheme 


xn+1  =  T[x„]? 


(2.10) 


An  alternative  formulation  that  eliminates  the  function  B  can  be  obtained  by  using  the  first  line 
of  (2.5)  to  get  B(t)  =  i i>(T  -  t)/x(T  -  t)  and  then  from  (2.7)  we  have,  writing  p{t)  =  l/x(f) 


ip(t)p(t)=  f  ip(T  -  t  -I-  a)0{a)p(<T  ~  Q ^  da  +  [  /3(a)d>(a)P^a  -da 

Jo  PW  ‘  Jt  P\a) 

+  IT 
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We  can  consider  this  as  a  nonlinear,  second  kind  integral  equation  for  the  unknown  p(t).  It  is  to 
be  noted  that  one  can  show  that  both  sides  of  the  abobe  equation  or  monotone  functions  of  A ,  but 
not  necessarily  monotone  in  the  variable  p. 

The  difficulty  in  any  analysis  of  the  mapping  T  is  in  the  relationships  between  the  overposed 
data  xp  and  the  functions  j3  and  <p.  Even  in  the  case  of  given  B(t)  this  was  a  complex  relationship. 
It  can  be  shown  that  |5(a,X!)  -  B(a,  ir2)|  <  Cki(a)  -  ff2(a)|  for  some  C  that  depends  on  the 
data  <p,  (3  and  xp.  This  leads  to  the  inequality  |T[xi]  -  T[t2]|  <  —  *"2(a)l»  and  if  we 

can  impose  conditions  on  the  data  that  guaranteed  C  <  1  then  we  could  invoke  the  contraction 
mapping  theorem  to  answer  our  questions.  We  were  not  able  to  find  any  reasonable  conditions  on 
the  data  that  lead  to  this  condition.  We  can  also  show  that  T  is  a  compact  map  on  the  space 
of  functions  that  are  continuous  on  [0,  T]  and  satisfy  the  bounds  xp(T)/<p(0)  <  it  <  1.  If  it  could 
be  shown  that  T  mapped  this  set  to  itself  we  would  obtain  an  existence  result  using  Schauder’s 
theorem.  Once  again  we  are  prevented  from  carrying  out  the  idea  by  the  interrelationship  of  the 
data. 

However,  we  have  attempted  several  numerical  implementations  of  the  algorithm  defined  by 
the  iteration  scheme  (2.10)  and  convergence  to  the  actual  solution  was  rapid,  if  the  starting  value 
was  taken  as  the  straight  line  joining  the  known  endpoint  values  at  a  =  0  and  a  =  T,  the  relative 
error  decreased  to  within  a  few  percent  in  3  or  4  iterations. 

3.  Recovery  of  a  Nonlinear  death  rate. 

We  consider  the  problem  of  recovering  a  population  dependent  death  function  A(a,  P) .  As  men¬ 
tioned  in  the  introduction,  we  shall  look  for  a  A  in  one  of  two  forms;  either  A (a,P)  =  Ajv(a){l  + 
f(P(t))}  or  A(a,P)  =  A N(a)  +  f(P(t)),  where  A N(a)  is  a  known  function.  Thus  we  seek  to 
determine  the  dependence  of  A  on  the  total  population  under  the  assumption  that  we  know  the 
age  dependence  of  this  function.  In  many  physical  models  this  is  realistic  since,  if  the  total  pop¬ 
ulation  is  much  smaller  than  the  carrying  capacity  of  the  environment,  it  is  to  be  expected  that 
overcrowding  would  play  a  negligible  role  in  the  death  process,  that  is  A  w  A/v(a).  The  values  of 
such  a  A/v  may  be  known,  or  could  have  been  obtained  from  the  methods  of  the  previous  section. 
From  physical  principles  we  expect  f(P)  to  be  a  nonnegative  function,  monotonically  increasing 
in  P  with  /( 0)  =  0.  In  our  mathematical  model  we  shall  require  /  to  satisfy  only  the  first  of 
these  conditions.  It  will  turn  out  that  both  of  the  above  forms  for  A  lead  to  similar  mathematical 
problems,  so  that  we  shall  present  the  details  for  one  and  only  briefly  point  out  the  differences 
presented  by  the  other. 

We  thus  seek  to  recover  the  pair  of  functions  (/,  p)  from 


Pt  +  Pa  +  A(o,  f)p  =  0 

(3.1) 

with  the  initial  condition 

p{a,0)  =  <p(a ) 

(3.2) 

and  either  of  the  boundary  conditions 

p(0,t)=  B(t) 

(3.3a) 

or 
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(3.36) 


i: 


(3(a)p(a,t)da 


=  0 


and  the  overposed  data 

P(t)  =  I  p{a,t)da  (3.4) 

Jo 

which  are  prescribed  over  some  interval  0  <  t  <  T . 

Since  the  unknown  /  only  appears  in  the  form  f(P{t)) ,  we  shall  make  the  change  of  variable 
f(t)  =  f(P(t)) .  Thus  /  is  uniquely  recoverable  from  /  provided  that  P  is  monotonic,  a  condition 
we  shall  require.  In  order  to  indicate  the  dependence  of  p  on  /  we  shall  write  the  solution  of  the 
direct  problem  (3.1)  -  (3.3)  by  p(a,t;  /). 

For  a  solution  pair  (p,  f)  we  require  that  /  6  Cm  for  some  M  >  0 ,  that  p(a,  t )  have  continuous 
partial  derivatives  on  (0,T)  x  (0,T)  except  on  the  line  a  =  t  and  that  (3.1),  (3.2),  (3.4)  and  one 
of  (3.3)  holds. 

We  shall  deal  with  the  case  A(a,P)  =  A/v(a){l  +  f(P(t)) }  for  the  moment  and  make  the  following 
assumptions  on  the  prescribed  data: 

Al.  A iv(a)  is  a  continuous,  positive  function  on  [0,  L)  with  /0L  A s{a)da  =  oo .  (3(a)  is  continuous 
and  non-negative  on  [0,  L ] . 

A2.  <f>(a)  is  a  continuous,  non-negative  function  with  <f>(a)  exp(/Qa  A N(s)ds)  uniformly  bounded 
on  [0,1]. 

The  condition  on  Aw  ensures  that  all  members  of  the  population  must  die  by  age  L .  The  condition 
on  4>  states  that  the  initial  population  be  consistent  with  the  death  process  A  =  A^. 

We  shall  denote  by  D(t;  f)  the  function  \^(a)p(a,t;  f)da.  Thus  D(t\  0)  corresponds  to 
the  case  when  the  death  process  is  independent  of  the  population,  and  in  the  case  of  this  linear 
model  we  can  interpret  D(t]0)  as  the  death  rate  at  time  t. 

We  consider  the  case  of  the  boundary  condition  (3.3a)  first.  In  this  case  we  assume  that  the 
prescribed  birth  function  and  the  overposed  data  P(t)  satisfy 
A3.  B(t)  is  a  continuous,  positive  function  on  [0,r]. 

A4a  P(t)  is  continuously  differentiable,  positive  and  strictly  increasing  function  on  [0,  T]  such 
that  for  some  M  >  0,  B(t)  -  (M  +  l)Z?(t;0)  <  P'{t)  <  B(t)  -  D(t;M)  for  0  <  t  <  T. 

We  shall  also  assume  that  the  birth  rate  at  t  =  0  shall  be  compatible  with  the  initial  data, 
B(t)  ss  Jq  f3(a)<t>(a)da  —  <£(0).  This  condition  will  hold  in  the  case  of  an  already  evolving  process, 
and  is  made  to  simplify  the  analysis.  At  the  end  of  this  section  we  indicate  how  this  (technical) 
assumption  may  be  removed. 

We  show  that  the  solution  to  the  inverse  problem  can  be  reduced  to  showing  that  a  certain 
mapping  of  the  function  /  has  a  fixed  point.  We  do  this  by  showing  that  the  map  is  contractive. 
This  also  leads  to  a  natural  iteration  scheme  that  may  be  used  to  compute  the  function  /.  In 
setting  up  this  structure  we  will'also  show  that  the  map  is  a  compact,  isotone  map  on  Cm  where 
M  is  as  in  A4a.  This  not  only  leads  to  an  alternative  existence  proof  but  shows  that  the  sequence 
of  iterates  converges  monotonically  to  the  solution  of  the  inverse  problem. 
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Differentiating  (3.4),  we  have 


P'(t)=  [  pt(a,t;f)da 
Jo 

=  /  -pa(a,t;f)-\p(a,t;f)da 
Jo 

=  p(0,t;f)~  j  Ma){l  +  f(t)}p(a,t-J)da 
Jo 

=  />(0,t;/)-{l +  /(*)}  /  A N(a)p(a,t\f)da 

Jo 


where  p(L,t\f)  =  0  by  assumption.  Hence 


}{t)  =  - i  =  t  [/]. 

Jo  Ay st(a)p(a,t;  f)da 


In  the  case  of  boundary  condition  (3.3a)  this  gives 


T  a[f)  = 


/0i^(aV(o,t;/)<ia 


and  for  condition  (3.3b) 

■J)±zm.l  (3.7) 

Jo  Ayv(o)p(a,  f;  f)da 

In  eithei  case,  this  suggests  that  we  look  for  a  fixed  point  of  the  mapping  T  =  T„  or  Tj ,  and  seek 
a  solution  of  the  inverse  problem  by  the  iteration  scheme 

/(n+1)  =  T[/*n)]  (3.8) 

for  sonr  suitably  chosen  initial  approximation  /o  •  It  is  straightforward  to  show  that  /  will  be 
solution  of  (3.1)  -  (3.4)  if  and  only  if  /  is  a  fixed  point  of  the  map  T  on  a  suitably  defined  subspace 
of  C[0,2  ]. 

In  the  case  of  condition  (3.3a)  we  can  in  fact  prove  the  following  theorem 

Theorem  1.  If  assumptions  Al  -  A4a  hold,  then  there  exists  a  unique  solution  f  6  Cm  to  (3.1), 
(3.2),  (3.3a)  and  (3.4).  This  solution  can  be  obtained  as  the  limit  of  a  monotonicaly  increasing 
sequence  {/n}£L o  where  /„  is  defined  by  (3.8)  with  T  =  Ta  and  f0  =  0. 

Proof.  The  solution  to  the  direct  problem  (3.1),  (3.2)  and  (3.3a)  can  be  written  in  the  form 


p(a,t-J)  = 


B(t  -  a)e~  fo  A*<»)li+/(‘-a+*>]<1'  if  o  <  t; 
4>(a  -  t)e~  Jo  if*  <  a. 


Thus  for  all  (a,  t)  €  [0,  L]  x  [0,  T\  and  for  /  >  0 


p(a,t;f)  <  max{||5||(0T],||<A||!0iq}. 
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Also  if  fi(t)  >  f2(t)  for  all  t  €  [0,T]  then  it  easily  follows  from  (3.9)  that  p(a,t;  f\)  <  p(a,t;  /2)  for 
all  (a,  t)  €  [0,  L\  x  [0,  T].  The  positivity  of  Xs  now  shows  that  D(t;  f2)  <  D(t;  f2)  for  0  <  t  <  T, 
and  hence  from  (3.6)  Ta[/i]  >  T0[/2].  Thus  Ta  is  an  isotone  map  on  C+[0,T]  and  Ta[/]  is 
continuous  on  [0,  T]  for  any  /  in  Cm  .  Also 


D(t,  0) 


-  1  <  Ta[/]< 


B(t)  -  P'(t) 
D(t;  M) 


Assumption  A4a  now  shows  that  0  <  Ta  [/]  <  M  and  thus  T„  :  Cm  -*■  Cm  • 

We  shall  use  the  notation,  A(a)  =  /0“  XN(s)ds,  x (a)  =  exp(-  /0°  A s(s)ds)  and  <£(a)  =  <p{a)/x(a). 
By  assumption  A2  this  last  quantity  is  uniformly  bounded  on  [0,  L] . 

Let  /i ,  /2  be  in  Cm  and  restrict  t  for  the  moment  so  that  t  <  L.  We  have  that 


D(t;  h)  -  D(t ;  /2)  =  f  A N(a)B(t  -  a)T(a)[e“  £  _  g-  f‘  xN(')Mt-a+,)d^ 

Jo 

fr  \  ,  ,7/  /  sr  -/*  Aw(*)/i(<-a+»)i<»  -  \s(*)f2(t-a+»)d*s, 

+  J  Xs(a)<p(a  -  t)x(a)[e  J—i  -e  Jda. 


Using  the  inequality  e~x  —  e~y  <  x  —  y  gives 

|T>(*;/i)  -  D(t;A)  I  <  ||5||[or]  J*  \„(a)x(a)A(a)da.  ||/,  -  /2  ||(0iT) 

+  J  AN(a)<£(a  -  t)ir(a)[A(a)  -  A(a  -  t)]  da  •  ||/i  -  /2||(0>;r] 


Now 


[  Xisr(a)ir(a)A(a)da  =  -  f  x'(a)A(a)da 
Jo  Jo 

=  1  -  7r(t)  -  A(t)e~A(t) 


and 
rL 


J*  A  N{a)4>{a  -  t)jr(a)(A(a)  -  A(a  -  t)]  da  <  ||<£|||0  Lj  J*  XN(a)x{a)[A(a)  -  A  (a  -  *)]  da 

=  H^l[o,i]  { 7r(*)A(*)  +  [Ma)  -  XN(a  -  t)]ir(a)  da} 

where  we  have  used  tt(0)  =  1,  x(L)  =  0  and  A(0)  =  0  and  the  fact  that  A  is  an  increasing 
function.  Let 

C,(t,AN)  =  1  -  x(t)  -  A(0e_A(° 

C2(t,  An)  =  c*A(‘)  A(t)  +  j  [AN(a)  -  A^(a  -  t)]e-A(o)  da 
It  follows  that  0  <  Ci  <  1  for  t  €  [0,  L\  and  limt_o  C\  =  0,  while  C2  satisfies 


C2(t,X„)<l  + J  X N(a)e~«a)da  + 
<  1  +  ?r(<)  -  x(L  -  t) 


X  v(a  -  t ) 


e-\{a-t)da 
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so  that  Ci  is  uniformly  bounded  on  [0,£].  The  above  estimates  show  that  [A^(a)  —  \s(a  — 
t)]e-^aUa  is  integrable  on  [0,  L\  and  thus  limt—oC2  =  0. 

We  therefore  have 

\D(t,  A)  -  £>(t;  A)|  <  C(t,  B,  *,  Aw)||  A  -  /2||  [0>T]  (3.10) 


where 

\N)  =  Cjit,  A//)||B||j0  Tj  +  Ci(t,  A^)||^j|[o  i) 

Note  that  C  is  independent  of  A  and  A  ? 's  bounded  on  [0,  L]  and  limt_*o  C  —  0 .  Thus 

lT.[Al-T4All<^^^.P(.;A)-i>(.;A>l 
^  c\\h  ~  All[o,r] 

where  C  =  C(t,  Ajv,  B,<(>,  M,  P)  is  independent  of  A  and  A  and  limt_*oC  =  0.  Thus  for  some 
fixed  r,  depending  only  on  M ,  B,  <j>,  A#  and  P 


sup  |Ta  [A]  -  Ta[A]|<  a\\fi  -  Allfo.n 


(3.11) 


where  a  <  1.  Thus  on  the  space  C[0,r]  Ttt(/]  will  be  a  contraction  mapping,  and  will  have  a 
unique  fixed  point  in  C[0,  r] .  By  the  bootstrap  procedure  we  may  extend  this  solution  to  all  of 
[0,  T ]  by  progressing  in  increments  of  r . 

This  shows  that  the  iteration  scheme  defined  by  (3.8)  will  converge  to  the  fixed  point  of  T„ . 
To  show  monotonicity  of  this  sequence  we  first  observe  that  Ta  is  an  isotone  mapping  on  Cm  and 
that  A4a  implies  Ta  [0]  >  0  and  T0  [M]  <  M .  Now  if  fk  is  any  bounded  sequence  in  Cm  it  is 
easily  seen  that  D(t;fk)  is  an  equicontinuous  family  since  D(t;fk )  <  Z?(t;0).  The  Ascoli-Arzela 
theorem  shows  that  Ta  is  a  compact  map  on  Cm  •  These  facts  provide  a  separate  existence  proof 
and  the  further  property  that  the  sequence  {/„}  defined  by  /n+i  =  Ta[/n],  /o  =  0  converges 
monotonically  to  a  fixed  point  of  T„,  [1].  If  instead,  fo  =  M,  then  /„  is  a  decreasing  sequence 
converging  to  a  fixed  point  of  T0.  Of  course,  from  (3.11)  these  fixed  points  must  coincide.  This 
completes  the  proof  of  the  theorem. 


For  the  case  of  boundary  condition  (3.3b)  our  program  is  the  same,  except  that  T*  as  defined 
by  (3.7)  now  has  the  additional  factor  of  the  unknown  “birth  function”  0(a)p(a,t;  f)da,  which 
also  depends  on  /.  We  shall  still  be  able  to  show  that  Tj,  is  a  contraction  on  a  subset  of  C[0,T'], 
but  we  cannot  expect  that  T*  will  be  an  isotone  map,  since  this  will  depend  on  the  relationship 
between  A y(a)  and  0(a).  Thus,  although  we  will  be  able  to  show  that  T&  has  a  unique  fixed  point 
which  can  be  approximated  by  the  sequence  generated  by  (3.8),  the  sequence  need  not  converge 
monotonically  and  perhaps  not  for  all  /o  in  our  admissible  class  C+[0,T]. 

We  shall  have  to  modify  our  conditions  on  P(t);  there  is  a  further  complication  added  to  this  case 
by  the  fact  that  T^is  not  monotone.  Now  it  is  no  longer  sufficient  to  verify  that  T[0]  >  0  and 
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T{,[M]  <  M  in  order  to  claim  that  T  maps  Cm  into  itself.  Actually  we  will  not  attempt  to  obtain 
an  upper  bound  for  the  solution  /;  we  will  be  content  to  show  that  T&  [/]  >  0  if  /  >  0,  for  then  we 
can  show  that  T(,maps  the  subset  C+[0,T]  into  itself  and  if  Tj,is  a  contraction,  then  the  existence 
of  a  unique  fixed  point  is  guaranteed.  For  this  problem  we  make  the  assumption 
A4b  P(t)  is  continuously  differentiable,  positive  and  strictly  increasing  function  on  [0,T]  such 
that  P'(t)  <  f^(0(a)  —  Ajv(a))p(a,t;  f)da  for  all  functions  /  6  C+[0,T]. 

Assumption  A4b  says  that  the  rate  of  change  of  the  total  population  cannot  exceed  the 
difference  of  the  birth  and  death  rates,  for  any  admissible  functions  /.  Since  this  condition  involves 
the  unknown  /,  it  is  not  particularly  satisfactory  and  will  be  difficult  to  verify  in  practice.  However, 
it  is  not  easy  to  give  an  easily-checked  condition  on  P(t)  that  will  guarantee  that  T(,will  map 
C+  [0,  T]  into  itself.  Although  the  function  p(a,  t;  /)  will  be  a  monotone  function  of  /,  we  cannot 
use  this  information  to  obtain  correct  bounds  on  P'(t)  since  the  function  0(a)  — Xs(a)  must  change 
sign  on  [0,  L).  To  see  this  last  fact,  note  that  in  order  for  P(t )  to  be  increasing  the  net  birth  rate 
must  be  positive.  This  cannot  be  the  case  if  0  <  X^  for  all  a  6  [0,  L).  On  the  other  hand,  0  is 
bounded  and  A  ^  is  unbounded  at  a  =  L .  We  are  thus  left  with  the  “average  condition”  indicated 
in  A4b.  However,  assuming  the  model  is  exact,  the  practitioner  can  be  satisfied  that  any  measured 
data  P(t)  must  have  come  from  a  physically  reasonable,  and  hence  positive  /.  Thus  there  will  be  a 
fixed  point  in  C+[0,  T\  to  which  the  iteration  scheme  will  converge.  The  monotonicity  assumption 
on  P  is  to  enable  the  function  /  to  be  recovered  from  /. 

Given  this  we  can  prove: 

Theorem  2.  If  assumptions  Al,  A2  and  A4b  hold,  then  there  exists  a  unique  solution  f  6 
C+[0,T]  to  (3.1),  (3.2),  (3.3a.)  and  (3.4).  This  solution  can  be  obtained  as  the  limit  of  the  sequence 
{fn}%Lo  where  fn  is  defined  by  (3.8)  with  T  -  T b  and  /o  =  0. 

Proof.  First  observe  that  since  0(a),  0(a)  and  A n(a)  are  positive  it  is  easily  shown  that  p(a,V,  f) 
will  be  non-negative  for  /  >  0.  For  f\  and  /2  in  C+[0,T]  we  let  Aj (a,P)  =  Ajv(a){l  +  /j(.P(0)}, 
i  =  1,2.  If  p(a,t;fi)  and  p(a,f;/2)  are  solutions  to  (3.1),  (3.2)  and  (3.3b)then  p(a,t)  =  p(a,t;f\)~ 
p(a,t ;  /2)  is  a  solution  of 


Pt  +  Pa  +  Ax(a,  P)p  =  F(a,  t) 

(3.12) 

p(a,0)  =  0 

(3.13) 

rL 

&(*;/i»/a)  :=  P(0,t)  =  /  0(a)p(a,t)da . 

Jo 

(3.14) 

where 

F(a,f)  =  AN(a)p(a,t;/1)[/2(t)-  /i(t))- 

Using  integration  along  characteristics  we  can  easily  show  that  p(a,  t)  satisfies 


9  ’  \  b(t  -  a)e~  Jo  Xl(9'P{t-a+'))da  f*  e~  J' X'{v'P{t~ 


if  t  <  a. 


a+r,))dr‘ F(s,t  -  a  +  s)ds  if  a  <  t; 

(3.15) 
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where  6(-)  =  6(-;/i,/2)  is  given  by  (3.14).  Substitution  of  (3.15)  into  (3.14)  leads  to  the  following 
second  kind  Volterra  equation  for  b(t) 

b(t)  =  f  b(t  -  a)e~  So  x'ia’Pit-a+t))d*(3(a)da  +  tf(t)  (3.16) 

JO 

where 

*(*)  =  J  {  J  Xi(a~t+a'P{a))da\N(s)p(a  -  t  +  s,s;/i)[/2(s)  -  /i(s)]ds}da 

+  J  {J  e~  X'(‘3'P('t-a+a))da*N(s)p(*,t  -  a  + s;fi)[f2(t  -  a  + s)  -  fi(t  -  a,  + s)]ds'jda, 

We  can  draw  two  conclusions  from  this.  First  if  fi(t)  <  /2(f)  on  [0,T],  then  ¥(t)  >  0.  This  in 
turn  shows  that  b(t)  >  0  and  thus  from  (3.15)  that  p(a,t;fi )  >  p(a,t;f2),  so  that  p(a,t;f)  is 
again  a  decreasing  function  of  /.  Thus  p(a,t;f)  <  p(a,t;  0)  for  /  6  C+  [0,  T\ .  Second, 


sup  |¥(s)|  <  C(t,\N,<t>,P)  sup  |/2(s)- fi(s)\ 

0<s<t  o <*<< 

where  C(-)  —  0  as  t  — ►  0.  Since  $  is  the  free  term  of  the  Volterra  equation  (3.16),  the  solution 
b(t)  of  this  equation  must  satisfy  the  estimate 


sup  |6(s)|  <  C(t,\]sf,<f>,P,/3)  sup  |/2(s)  -  /i(s)|. 

0<a<t  0 <a<t 

We  can  use  this  estimate  to  show  in  a  manner  similar  to  that  leading  to  (3.10)  that 

-  mh) I  <  CM,*sM\h  -  /2|I[0,ti 

where  C(t,<f>,  Xn,/3)  — *■  0  as  t  -*■  0. 

Now  for  f\  and  /2  in  C+[0,T]  we  have 

D{t\h)[p{g,t-,h)  -  i»(0]  -  P{t-h)[p{g,t-f2  -  P'(t)\ 


(3.17) 


(3.18) 


|T6[/i)  -  X6[/a]|  < 


K  jpjMj h)  +  P'(t)\\D(t,  A)  -  P(t;  /2) |  j-  D(t;  A)|b(f;  fuf2) | 

D2(t;M) 

<  |p(q,  t;  0)  +  P'(QHI>(<;  h)  -  D(t;  f2)\ +  D(t ;  0)|fr(t;  fu  f2)\ 

D2(t\  M) 

Combining  the  above  with  the  estimates  (3.17)  and  (3.18),  we  obtain 


~  T^\  z  cWh  ~  All[o ,t] 

where  C  depends  on  the  known  functions  \s,  0  and  <p  and  C  -*■  0  as  t  ->  0.  Thus  for  some  fixed 
r ,  depending  only  on  the  given  data,  there  is  an  a  <  1  such  that 


sup  |T.[/1]-T.[/2]|<o||/I-/a||I031 

0<t<T  1,1 

As  in  the  previous  case,  we  may  bootstrap  this  local  fixed  point  to  one  obtained  on  the  interval 
[ 0,T ].  This  completes  the  proof  of  the  theorem. 

The  above  analysis  shows  that  we  have  in  fact  proved  the  following 
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Theorem  3.  If  assumptions  A1  and  A2  hold,  with  P'(t)  a  positive,  continuous  function  on  [0,T] 
and  if  there  exists  a  solution  f  £  C+(0,T]  to  (3.1),  (3.2),  (3.3b)  and  (3.4),  then  this  solution  can 
be  obtained  as  the  limit  of  the  sequence  {/n}£L0  where  /„  is  defined  by  (3.8)  with  T  =  T&  for 
fo  sufficiently  close  to  /.  Moreover  there  is  at  most  one  solution  in  C+[0,T] . 


Let  us  now  look  at  the  analysis  when  the  death  function  A  has  the  form  A (a,P)  =  Ayv(a)  + 

/(J*(0). 

In  this  case  the  mapping  T  in  (3.5)  takes  the  form 


-  _  p(0,t;f)-P'(t)-D(fJ) 
L/J  P{t) 


(3.19) 


where  as  before,  D(t;  /)  is  given  by  X^(a)p(a,  t;  f)da ,  and  p(a,  t;  f)  represents  the  solution  to 
the  direct  problem  (3.1)  -  (3.3),  which  can  be  written  in  the  form 


p(t  -  a,  0)e  /o  if  act; 

</,(a  -  t)e~  fo  if  t<a. 


(3.20) 


With  an  analysis  much  like  the  previous  case  we  can  show  that  T[/j  is  a  contraction  on  the 
space  C+[0,T]-,  and  that  for  /j  and  h  in  C+[0,T]  with  f\  >  A,  p(a,t;fi)  <  p(a,t;fi)  so  that 
D(t;  fi)  <  D(t;fi).  For  the  case  of  boundary  condition  (3.3a)  where  p(0,t)  is  prescribed  data 
B(t),  this  shows  that  T  is  an  isotone  mapping  on  C+[0,T]. 

We  assume  that  P{t)  is  a  continuously  differentiable,  positive  and  strictly  increasing  function 
on  [0,  T] .  If,  in  addition,  it  satisfies 

P(0)e~m  +  /  e-M<‘-T)[£(r)  -  Z?(r;  M)\dr  <  P(t )  <  P(0)  +  f[B(r)  -  D(r ■  0)]dr  (3.21) 

Jo  Jo 

for  some  M  >  0,  then  it  is  easily  shown  that  under  condition  (3.3a)  T][0]  >  0  and  T][Af]  <  M , 
with  the  consequence  that  T  maps  Cm [0,  T\  into  itself.  These  results  can  be  summarized  in 

Theorem  4.  Let  X(a,P)  =  A#  +  f(P)-  If  assumptions  A1  -  A3  and  (3.21)  hold,  then  there 
exists  a  unique  solution  f  E  Cm  to  (3.1),  (3.2),  (3.3a)  and  (3.4).  This  solution  can  be  obtained 
as  the  limit  of  a  monotonicaly  increasing  sequence  where  /„  is  defined  by  (3.19)  with 

p( 0,  t)  =  B(t)  and  f0  =  0. 

Theorem  5.  Let  A(a,  P)  =  \N  +  f(P).  If  assumptions  A1  and  A2  hold,  with  P'(t)  a  positive, 
continuous  function  on  [0,  T]  and  if  there  exists  a  solution  f  £  C+[0,  T]  to  (3.1),  (3.2),  (3.3a)  and 
(3.4),  then  this  solution  is  unique  and  can  be  obtained  as  the  limit  of  the  sequence  {fn}%Lo  where 
fn  is  defined  by  (3.19)  with  p(0,t;f)  =  f0L  /3(a)p(a,t;;f)da  for  fo  sufficiently  close  to  /. 

In  the  case  of  Theorem  4  where  p(a,  t;  /)  =  B(t )  is  prescribed,  the  recovery  of  /  is  particularly 
easy.  It  is  straightforward  to  show  that  the  equation  T[/]  =  /  is  equivalent  to  a  linear  Volterra 
integral  equation  of  the  second  kind  for  the  function  exp (/0*  f(s)ds). 
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If  we  do  not  make  the  compatibility  assumption  that  fQL  (3(a)<j>(a)da  =  4>(Q)  then  we  have  to 
modify  our  argument  to  include  the  possibility  that  p(a ,  t)  might  be  discontinuous  along  the  line 
a  =  t.  We  should  now  write  (3.4)  in  the  form 

P(t)  =  J  p(a,t)  da  +  ^  p(a,t)da 

and  when  we  differentiate  this  equation  in  t  we  obtain  an  additional  term  due  to  the  jump  discon¬ 
tinuity.  The  new  version  of  (3.5)  becomes 


j(t)  =  p(0,t-J)-P'(t)  +  [B(0)-m}e 


-U+/(0)  *N(s)ds 


fo  *N(a)p(a,t\ f)da 


-  1  =  T[/] 


(3.22) 


where  B{ 0)  is  data  in  the  case  of  condition  (3.3a)  and  equal  to  f3(a)<j>(a)da  in  the  case  of  (3.3b). 
We  can,  as  before,  show  that  T  is  a  contraction  provided  |5(0)  —  <£(0)|  is  sufficiently  small.  With 
condition  (3.3a)  T  will  still  be  an  increasing  map  if  5(0)  <  <f>(0)  and  Theorem  1  will  go  through 
as  before. 
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1.  Introduction 


Consider  a  physical  heat  conduction  process  in  a  dynamical  system  Where  we  assume  that 
the  heat  flux  across  the  boundary  varies  with  the  temperature  but  do  not  know  the  actual 
law  of  heat  exchange  between  the  system  and  its  surrc^ding.  This  situation  could  occur 
when  such  exchange  is  a  combination  of  both  heat  convection  and  radiation.  It  is  therefore 
of  physical  interest  to  be  able  to  recover  such  a  law  of  exchange,  and  this  paper  addresses 
the  question  of  whether  the  law  can  be  uniquely  determined  by  a  measurement  of  the 
temperature  at  a  certain  fixed  point  on  the  boundary.  We  show  that  this  is  the  case,  and 
in  addition  prove  that,  in  a  sense  to  be  defined,  depends  continuously  on  the  values  of  the 
temperature  measurement. 

The  precise  mathematical  statement  of  this  physical  description  is  as  follows: 

Let  T  >  0  and  Qx  =  ft  x  (0,T],  where  ft  is  *  bounded  region  in  Rn  with  a  smooth 
boundary  5  =  5ft.  Find  a  pair  of  functions  u(x,t)  and  p(s)  defined  on  Qx  and  [A,  B], 
respectively,  which  satisfy  the  following  equations: 


ut  -  Au  =  f(x,t), 

in  Qt , 

(1.1) 

du 

OX  +p(u)  =  g(x,t), 

on  Sx  =  5ft  x  [0,T], 

(1.2) 

u(x,0)  =  u0(ar), 

on  0, 

(1.3) 

and  the  additional  condition 

u(x0,t)  =  h(t),  t€[0,r]. 

where  xq  is  a  fixed  point  of  5ft,  N  is  the  inner  normal  to  5ft  and  A  =  min<3Tu(x,t)  and 
B  =  maxQTu(x,t). 

Recently,  considerable  attention  has  been  paid  to  the  recovery  of  one  or  more  coefficients 
for  a  parabolic  initial-boundary  problem  from  over-specified  conditions,  see  [1,  3,  7,  10]  for 
example.  Several  authors  have  considered  inverse  parabolic  problems  in  which  an  unknown 
coefficient  appears  on  the  boundary.  One  model  makes  the  assumption  that  the  heat  flux 
across  the  boundary  is  a  linear  function  of  the  temperature,  that  is  the  relation 

-Ug°x' +  h(t)u( 0,  t)  =  g(t),  t  6  [0,  T], 

holds,  but  where  h(t)  is  unkown.  T.  Suzuki,  [13,  14]  proved  that  one  can  uniquely  determine 
the  function  h(t)  from  spectral  data  using  the  method  of  Gel’fand- Levi  tan.  For  additional 
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work  in  this  direction  see  [9],  [11].  For  the  semilinear  problem  (1.1)  -  (1.3),  Scheglov,  [12], 
used  integral  estimates  to  prove  uniqueness  of  the  solution  in  the  class  of  piecewise  analytic 
functions.  More  recently,  Pilant  and  Rundell,  [10]  studied  the  problem  (1.1)-(1.4)  in  one 
space  dimension  and  established  a  local  existence  and  uniqueness  result  using  the  contract 
mapping  principle. 

Many  of  the  uniqueness  results  obtained  for  inverse  problems  for  semilinear  parabolic  equa¬ 
tions  rely  on  the  monotonicity  of  the  solution  in  the  time  variable.  Not  only  does  this 
restrict  the  class  of  problems  considered,  but  uniqueness  results  are  obtained  only  within 
a  restricted  class  (essentially  analytic  functions)  of  coefficients.  Such  was  the  case  for  the 
recovery  of  unknown,  temperature  dependent  conductivities  or  forcing  functions  from  over- 
specified  boundary  data  in  [4],  [7],  [8].  The  aim  of  this  paper  is  to  show  the  uniqueness  and 
the  continuous  dependence  of  the  classical  solution  for  (1.1)-(1.4)  without  making  aproiri 
assumptions  on  the  monotonicity  of  u.  Our  technique  has  the  additional  advantage  of 
showing  uniqueness  in  what  is  essentially  the  largest  class  of  coefficients  for  which  a  strong 
solution  of  the  direct  problem  exists.  The  proof  is  based  on  singularity  estimates  for  the 
fundamental  solution  of  the  heat  equation  and  an  application  of  the  generalized  Bellman- 
Gronwall  inequality. 

2.  Notation  and  Statement  of  Main  Results 

We  follow  the  notation  of  [5]  for  the  spaces  and  their  norms.  By  a  solution  to  the  problem 
(1.1)-(1.4)  we  mean, 

Definition:  A  pair  of  functions  u(x,  t )  and  p(s)  defined  on  Qt  and  [A,  B\ ,  respectively, 
is  called  a  classical  solution  of  the  problem  (1.1)-(1.4),  if 

(1)  u(x,t)  E  C3,}(Qt)  and  p(s)  E  CX[A,B]\ 

(2)  the  inequality  h( 0)  <  u(x,t)  <  h(t)  for  x  E  S  and  t  E  [0,T]  holds; 

(3)  the  equations  ( l.l)-(  1.4)  are  satisfied  in  the  classical  sense,  where  A  =  min QTu(x,t), 
a».d  B  =  min(j7.u(x,t). 

Remark:  As  was  pointed  out  in  [10],  one  can  not  in  general  expect  that  a  solution  of 
problem  (1.1 )-( 1 .4)  has  the  property  (2),  and  it  is  difficult  to  give  conditions  on  the  data  to 
guarantee  this  inequality  since  u(x,t)  depends  on  the  unknown  function  p(s).  In  addition, 
if  p  >  0,  the  physical  interpretation  is  that  heat  is  being  pumped  into  the  region  through 
the  boundary  and  if  p(u)  is  not  uniformly  Lipschitz  then  the  temperature  may  “blow  up” 
in  finite  time.  In  this  case  we  could  not  expect  to  solve  even  the  direct  problem  for  all 
values  of  T . 

We  assume  the  following  basic  regularity  assumptions  hold  throughout  this  paper: 
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R:  The  functions  g(x,t )  £  C2,2(5t)  and  h(t )  6  C2[0,T].  Moreover,  the  function  h(t) 
is  montonic  increasing  on  [0,  T ] . 

Our  main  result  is: 

Theorem.  Assume  the  condition  R  is  satisfied.  Let  ( u(x,t),p(s ))  and  (u(x,  t),p(s))  be 
two  solutions  of  the  problem  (1.1  )-(l-3)  satisfying  the  overposed  conditions 

u(xo,t)  =  h(t),  0  <  t  <  T  (2.1) 

and 

u(iO)  0  =  h(t)  0  <t<T, 

respectively.  Let  H(s)  and  H(s )  denote  the  inverse  functions  of  h(t)  and  h(t) 

sup  |p(s)  -  p(s)|  <  C  sup  \H(s)  -  H(s) |, 

A<a<B  A<a<B 

where  C  is  a  constant  dependent  upon  u(x,t),u(x,t)  and  the  known  data. 

This  shows  the  continuous  dependence  of  the  function  p(s)  on  the  overposed  data  h(t)  int 
supremum  norm.  From  this  result  we  immediately  obtain  uniqueness  of  the  solution  of  the 
inverse  problem 

Corollary.  Under  the  assumption  R,  the  problem  (1.1)-(1.4)  can  possess  at  most  one 
solution. 


(2.2) 
.  Then 


3.  Proof  of  the  Theorem 

We  will  need  two  results  which  we  state  as  lemmas.  The  proof  of  the  first  is  based  on  an 
iteration  argument  and  can  be  found  in  [6j.  The  verification  of  the  second  lemma,  which 
appears  in  [5],  consists  of  a  straightforward  calculation. 

Lemma  3.1.  Let  f{t)  be  a  nonnegative  non-decreasing  function  on  [0, T1]  and  let  k  £ 

(0,1).  If 

v(t)<m+ J‘ 

then  for  some  constant  C  =  C(k,T), 


y(t)  <  cm. 
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Lemma  3.2.  Let  0  <  a,  6  <  n  —  1,  then 


[  dy.  <{c\x~  si*-1 

Js\x  -  y\u\y-Z\b  1c 


-i-a-6}  if  a  +  b  >  n  —  1; 

if  a  +  6  <  n  -  1. 


Let  (u(x,t),p(s))  and  ( u(x,t),p(s ))  be  two  solutions  of  the  problem  (1.1)-(1.3)  with  the 
overposed  data  (2.1)  and  (2.2),  respectively,  and  let  W(x,t)  =  u(x,t)  -  u(x,t)  for  (x,t)  E 
Qx  ■  Then  W{x,t)  must  satisfy  the  equations: 

Wt  -  A W  =  0,  in  Qt ,  (3.1) 

dW 

— +p(u)-p(u)  =  0,  on  ST  (3.2) 

W(x,0)  =  0,  eCl.  (3.3) 


Proof  of  Theorem:  From  the  conditions  (1.2)  and  (2.1),  we  have 

+  p(h(t))  =  $(x0,  t ),  0  <  t  <  T, 

and  it  follows  by  assumption  R  that 

p{s)  =  g(x  o,  H(s))  -  ,  A<s<5,  (3. 

where  #(3)  =  /i_1(s)  is  the  inverse  function  of  h(t).  Similarly,  we  have 

P(s)  =  p(x0,  H{s))  -  dU^XQ^3^  »  A<s<  B,  (3 

where  /f(s)  =  h~l(s)  is  the  inverse  of  h(t).  Applying  the  mean  value  theorem,  we  find 

p{u(x,  0)  -  p(u(x,  0)  =  [g{x 0,  H(u(x,  t))  -  g(x 0,  H(u(x,  *))] 

du(x0,-ff(u(x,Q))  _  du(x0,H(u(x,t))) 

dN0  dN0 

dW(x0,H(u(x,t))) 

=  /3(x,t)w(x,t) - v  ^  — 

+  7(*,0[ff  (a) -#(*)]  (3 


where 


0(x,t)  =  [«7<(xo,0i(x,f))  - 
l(x,t)  =  gt(x0,9*(x,t))  - 


52u(x0,^(x,0)1 


dtdNp 

d2u(xo,0s{x,t)) 

dtdNo 


H'{e3(xj)),  ( x,t)esT , 


,  (x.t)  e  5t, 
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while  6\ (x,t)  and  9? (x,  t)  are  between  H(u(x,t))  and  H(u(x,t)),  63(1,  t)  is  between 
u(x,t)  and  u(x,t)  and  ^(x,t)  and  9s(x,t)  and  between  ff(u(x,t))  and  H(u(x,t)). 

It  is  clear  by  its  definition  that  (3(x,t)  is  continuous  on  St-  We  now  substitute  p(u)  —  p(u) 
in  equation  (3.2)  by  its  expression  in  (3.6)  and  employ  the  representation  of  the  solution 
for  a  second  kind  boundary  value  problem,  (see  [5]).  This  gives 

W(x,  f )  =  f  [  r(x,  t ;  £,  r)<p(£,  t)  dS^dr,  (x,  t )  €  Qr,  (3.7) 

Jo  Js 

where  the  function  r(x,t;f,  r)  is  the  fundamental  solution  of  the  heat  equation  (3.1),  and 
the  function  v?(x,t)  is  the  solution  of  the  integral  equation 

¥>(x,  0  =  2F(x,  t)  +  j*  J\ [  +  /*(*,  t)T(x ,  t;  6  r)] <?(£,  r)  (3.8) 


for  (x,  t)  6  5x  and  where 

F(x,t)  =  +  7(x,  0  [g(«Kx,  0)  -  5(u(x,t))],  (x,t)  £  5t. 

It  is  well-known,  [5],  that 

\T(X,  t;  S,t)\<  x,  £  €  n,  0  <  r  <  t  <  7\  (3.9) 

where  n  is  an  arbitrary  number  in  the  interval  (0,1).  Moreover,  since  5,  as  well  as  the 
coefficients  of  the  equation  (2.1),  are  smooth,  we  apply  the  estimate  (2.12)  of  Chapter  5  in 
[5]  with  fi  =  1  to  find  that 


dr(x,t;£,r),  _ C _ 

dN(x,t)  1  “  It-i-Mx-fl"-2"’ 


(3.10) 


where  n  is  an  abitrary  value  in  the  interval  (^,  1). 

If  we  restrict  p  in  the  interval  (|,1),  and  use  the  estimates  (3.9)  and  (3.10)  in  (3.8),  we 
obtain 


<  2|F(*,0I  +  2C  J‘  Js  K(dr 


<2|F(x,t)|  +  2C  /' 
Jo 


ll^(  •  > r)lli~(n) 


I  t-T\» 

where  we  have  used  Lemma  3.2  to  obtain  the  final  inequality. 


dr,  (x,t)  €  ST- 


(3.11) 
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Let  F'(t)  =  sup0<{<t 1| F(  •  ,0||  t~(5)  •  Then  using  (3.9),  for  t  e  [0,T] 

IW  • . Ollt-,*,  <  2||F(  • ,  Ollt^s,  +  <?  J'  dr, 

<2F-^  +  cJ‘^1^pldr, 

and  an  application  of  Lemma  3.1  gives 

•M-,0IIl~(5)  <CF*(t),  t  e  [o,r]. 

If  we  now  differentiate  (3.7)  in  the  direction  No  and  evaluate  at  tq  ,  we  obtain 
dW(xo,t) 


dN„ 


From  the  assumptions  on  a  solution,  one  has 

^i(O)  <  u(x,t)  <  h(t),  x  6  5,  t  e  [O.T]. 


and  thus 
Consequently, 

Let  us  define 

New  for  t  G  [0,  T] , 


0  <  H  («(*,  t ))  <  t,  t€  [0,  T],  x  6  dil. 


sup  ^(x.  ff(u(x,r))  s  sup  .aw^xp.r). 
0<r<<  dNo  0<r<t  dNo 


Fmd)=  sup  ||F(  •  ,r) 


0<r<f 

<  sup 

0<r<t 


<  sup 

0<r<t 


1  L°°(S) 


■  dW(xp,H(  •  ,r)) 

1  dNo  Ul~(S) 

+  sup  ||7(  •  ,r){^ (u(  •  ,r))  —  H («( •  ,r))}|| 


L°°{S) 


0<r<t 

Hl»(S)  +  ^0®“P(  '  >r))  ~  #  (s(  •  >r)/iu~(s> 


=  5(t)  +  C  sup  ||  #(«(•,  r))  —  R  (u(  • ,  r))]||^0 

0<r<t 


(5) 


(3.12) 


(3.13) 


(3.14) 
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and  we  can  use  the  estimates  (3.10),  (3.12)  and  (3.14)  to  obtain 


S(t) 


dr(x0,r;$,r)i 

dN0  I 


!*>(£,  r)|dS<dr 


<  sup  f  [  I 
0<r<tJ0  JS 1 

r  f  c  M-»T)ii£,~(5)ia:o-^in-2'‘ 

<  sup  /  /  - - —  •  - — - dSedT 

o<r<tJo  Js  |r-r|M 

frM  '^-(s)  j 

~  o  <  <r  Jo  |r-r|** 

<C  sup  [r{s(T)  +  C  sup  \\H(u(-,0)  -  #(u(  •,£))!! 
0<r<tJo  1  0<£  <r 


L°°(S) 


<<7(5(0  +  sup  ||JT(fl(.,r))-^(fi(.,r))|| 

0<r<t 


L°°(S) 


sup  /  17 

0<r<tJo  lr 


j|r  —  r|~'1  dr 
dr 

~r\» 


= -j - -  •  {5(0+  sup  \\H(u(-,r))- n(u(-,r))\\LOO{S)} 

1  ~  I*  0<r<t  '  ' 

<  C*!‘-“{S(1)+  sup  ||ff(a(  •,.•)) -fl(e(-,r))||  }. 

n  #  '  7 


If  we  restrict  T  by  the  condition  0  <T  <Tp  —  >  we  have 

S(t)<c  sup  \\H  (u(  •  ,r))  —  H  (u(  • ,  r))||  £°o(5) ,  l€[0,ro]. 

0<r<t  y  ’ 


We  can  repeat  the  above  procedure  and  obtain  the  estimate  (3.15)  for  all  To  <  T  provided 
that  the  direct  problem  has  a  solution  over  this  range. 

From  (3.4)  and  (3.5)  one  has 


I P(0  -  p(s) I  =  Hot# (0  -  -5(a)]  - 


dW(x0,H(s)) 

dN0 


<C\H(s)-H(s)\+CS{H(s)), 


(3.16) 


where 


a(«)  =  9t{x0,<Ti(s)) 


d2u(x0  ,<t2(s)) 
dN0dt 


while  <7i(s)  and  <73(5)  are  the  respective  mean  values  of  the  functions  £(xo,0  and 
between  H(s)  and  H(s).  Noting  that  A  <  u(x,t)  <  B ,  we  have  from  (3.15)  that 


S(BU))<C  sup  ||ff(a(-,r))-fl'(a(.,r))||1.( 

0<r<R($) 

<C  sup  |#(s)  -  (,»)|. 

A<a<B 


8 


Finally,  we  apply  the  above  estimate  to  the  inequality  (3.16)  and  obtain 

sup  |p(s)  -  p(s)|  <  C  sup  |#(s)  -  #(s)| 

0<j<B  a<*<b 

to  complete  the  proof  of  the  theorem. 
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1.  Introduction 

We  consider  the  problem  of  recovering  an  unknown  coefficient  (or  coefficients)  from  a 
parabolic  initial  boundary  value  problem  of  the  form 


ut  =  Z,[u]  in  SI  X  [0,r]  (1.1) 

u(x,0)  =  uo(x)  on  fi  (1.2) 

G[u]  =  0  on  dQ  x  [0,  T]  (1.3) 

subject  to  additional  information 

5[u]  =  0.  (1.4) 

The  domain  Q  is  an  open,  simply  connected  region  in  Rn  -  although  in  this  talk,  for 
simplicity,  we  primarily  consider  Q  =  (0, 1),  and  n  =  1.  The  overposed  data  will  usually 
be  in  the  form  of  additional  boundary  measurements.  The  unknown  coefficients  may  be 
present  in  L  or  G  (or  both),  and  we  assume  the  unknown  coefficients  are  functions  of  the 
single  dependent  variable  u.  Cases  where  the  unknown  coefficients  depend  only  on  x  or  t 
have  been  extensively  studied  and  will  only  be  mentioned  briefly.  Reference  [7]  contains  a 
summary  of  many  of  these  results  and,  in  addition,  has  an  extensive  bibliography  of  over 
1000  articles  pertaining  to  the  one-dimensional  heat  equation.  Several  recovery  methods  are 
considered,  and  the  relationship  between  the  methods  is  explored.  Various  issues  arising  in 

•This  research  was  supported  in  part  by  NSF  grant  #DMS8701338  and  ONR  grant  #ONR  N00014- 
89- J 1008. 
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discretization,  numerical  solution,  and  treatment  of  noisy  data  are  also  discussed.  Finally, 
a  class  of  fixed  point  schemes  based  on  residual  updating  are  reviewed. 

The  main  goal  of  this  talk  is  a  brief  introduction  to  some  of  the  methods  used  for 
nonlinear  coefficient  recovery,  and  an  examination  of  their  interrelationships.  A  survey  of 
some  of  the  literature  in  this  subject  is  included  in  the  bibliography. 

The  outline  of  this  talk  is  as  follows: 

1.  Introduction 

2.  Background 

3.  Model  Equations  and  Inverse  Problems 

4.  Meta-Theorem 

5.  Overposed  Data 

6.  Existence,  Uniqueness,  and  Monotonicity  Results 

7.  Numerical  vs.  Analytic  Methods 

8.  Discrete  vs.  Continuous  Methods 

9.  Multiple  Unknown  Coefficients,  Problems  with  High  Dimension 

10.  Further  Questions  and  Open  Problems 

11.  Summary 

2.  Background 

In  order  to  motivate  the  discussion  of  this  particular  class  of  inverse  problems,  we  begin 
with  some  examples  of  models  and  applications  which  have  appeared  in  the  literature. 

Physical  Models:  Classical  examples  of  the  equations  we  consider  are 
Heat  Conduction: 

tit  -  V.(I?Vu)  =  / 


Chemical  Kinetics: 

tit  -  Diu xx  =  /i(u,  v) 
vt  -  D2vxx  =  f2{ tt,v) 


Population  Dynamics: 


dP 

or,  —  - DPxx  =  f(P ) 


where  D  is  a  diffusion  coefficient  and  /  is  a  source  term.  (In  all  the  above  cases,  standard 
initial  conditions  and  boundary  conditions  are  imposed).  The  models  may  be  in  the  form 
of  ordinary  or  partial  differential  equations,  single  equations  or  systems,  and  possess  one  or 
more  spatial  dimensions. 

The  physical  models  are  initially  black  boxes,  in  the  sense  that  the  details  of  the  un¬ 
derlying  process  are  imperfectly  understood: 


Physical 

process 


initial  + 
boundary 
conditions 


interior  + 

boundary 

information. 
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The  physical  models  are  initially  black  boxes ,  in  the  sense  that  the  details  of  the  un¬ 
derlying  process  are  imperfectly  understood: 
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We  may  have  some  a  priori  information  on  the  form  of  the  mathematical  model  by  imposing 
conservation  laws.  For  example,  one  often  applies  a  conservation  of  energy  balance  in  the 
form 


Change  in  _  Energy  Lost  Energy  from 

Total  Energy  Through  Boundary  Internal  Sources 


For  a  model  of  heat  flow,  if  E  denotes  the  specific  energy,  Q  the  heat  flux,  and  7  the  heat 
generated  in  the  interior  of  the  region  ft,  we  have 


d_ 

dt 


IiEdV 


j>  Q-nds+  J  J  7  dV  =  J  J  V.Q  dV  +  J  J  -jdV. 


Given  sufficient  regularity,  this  leads  to 


dE  du 
du  dt 


V.Q  +  7. 


We  denote  by  c  =  the  specific  heat.  If  c,  7,  and  Q  depend  on  the  unknown  u  and, 
in  addition,  Q  depends  monotonely  on  the  gradient  Vu,  we  have  the  quasilinear  parabolic 
equation 

c(x,  t,  u)ut  -  V.Q(x,  t ,  u,  Vu)  =  7(1,  t,  u). 

Assuming  a  linear  relationship  between  Q  and  Vu  leads  to  Q  =  DVu  and  to  the  relation 


c(x,  t,  u)ut  -  V.U(x,  t,  u)Vu  =  7(1,  t,  u). 

How  good  is  this  model?  This  depends  on  the  physics  included,  the  nature  of  the  materials 
involved,  and  a  priori  assumptions  on  the  nature  of  physical  laws. 

The  direct  problem  is  to  prescribe  the  coefficients  {c,  $,7},  impose  appropriate  initial 
and  boundary  conditions  ( primary  data),  and  use  them  to  solve  for  the  dependent  variables. 
The  inverse  problem  is  to  recover  one  or  more  of  the  unknown  coefficients  as  well  as  u  by 
prescribing  additional  data.  If,  for  example,  we  wish  to  deduce  the  relationship  of  D  to  its 
arguments,  we  must  perform  experiments.  We  know  the  input,  and  resulting  output,  and 
want  to  find  D.  This  is  a  classic  example  of  an  inverse  problem. 

In  the  general  case,  we  may  expect  I?  to  be  a  complicated  function  of  x,  t,  u,  and 
Vu.  Under  special  circumstances,  it  may  have  a  simpler  dependence.  If  the  material  is 
very  uniform,  we  do  not  expect  material  coefficients  to  depend  on  the  spatial  variable  x. 
If,  in  addition,  the  material  properties  do  not  change  in  time  when  the  dependent  variables 
are  held  fixed,  then  we  may  expect  that  the  coefficients  do  not  depend  strongly  on  t.  In 
such  cases,  a  functional  dependence  on  only  the  dependent  variables,  and  perhaps  their 
gradients,  is  not  an  unreasonable  assumption. 

The  case  where  the  unknown  coefficient  D  is  a  function  of  t  only,  has  been  investigated 
in  [3,  4,  7,  17,  23].  In  some  sense,  this  was  a  first  approximation  to  the  nonlinear  coefficient 
recovery  case,  in  that  the  unknown  coefficient  may  depend  much  more  strongly  on  t  than 
on  x.  If  u (x,t)  is  slowly  varying  spatially,  then  D(u(x,t))  rsj  D(t). 
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If  the  diffusion  D  is  a  function  of  x,  we  generally  need  interior  measurements,  or  many 
(probably  infinite)  boundary  measurements.  [The  case  of  recovering  the  coefficient  D(x) 
in  V.(D(x)Vu)  =  0  is  discussed  in  the  papers  by  Isakov  and  by  Sylvester  and  Uhlmann  in 
this  proceedings].  If  D  is  a  function  of  u  alone,  one  can  show  in  some  cases  that  a  single 
boundary  measurement  suffices  [35].  The  methods  used  in  each  of  these  problems  are  quite 
different. 

If  we  assume  an  a  priori  dependence  D  =  D(t),  D  =  D(x),  or  D  =  D(u),  we  can  match 
the  model  to  data  (output).  What  if  we  do  not  know  the  nature  of  dependence?  For  a  first 
approximation  we  might,  perhaps,  set 

D(x,  t ,  u)  «  D-i(x)  +  Di(u)  +  D3(t) 


to  correspond  to  the  first  few  terms  in  an  expansion  for  D.  If  one  term  dominates  the  others, 
we  can  recover  the  primary  dependence,  and  possibly  recover  the  other  terms  after  fixing 
the  form  of  the  main  dependence.  However,  are  we  entirely  sure  that  the  model  is  correct? 
In  any  mathematical  model  of  a  real  physical  process,  terms  invariably  are  neglected.  The 
accuracy  of  the  simplified  model  can  only  be  tested  by  experimentation  or  fitting  to  data. 
In  certain  parameter  regions,  the  simplified  model  may  be  adequate,  but  as  the  limits  of 
validity  are  reached,  one  must  return  to  the  underlying  process  and  re-examine  the  model. 

If  large  gradients  are  present,  Q  may  depend  on  Vu  in  a  nonlinear  fashion  and,  there¬ 
fore,  D  will  depend  on  Vti.  An  even  more  complicated  dependence  may  occur  if  the  process 
is  not  in  equilibrium.  As  an  example  of  this,  consider  the  case  where  there  is  rate  dependence 
in  the  flux  law;  that  is,  where  the  flux  Q  does  not  instantaneously  respond  to  the  gradient 
Vu.  Thus,  there  is  a  characteristic  relaxation  time  c.  Assuming  to  a  first  approximation 
that 

edtQ  +  Q  =  D0Vu  Do  =  constant,  c  <<  1 
we  have  the  convolution 


Q  =  D0  l  exp(— (-)(*  -  r))Vu(x,r)dr 
J  -  00  e 

=  k  *  DoV u, 


and 


it<  -  k  *  V(D0Vu)  =  7, 


which  is  an  example  of  a  heat  equation  with  memory.  Determining  which  of  the  many 
types  of  nonlinear  dependencies  is  present  is  precisely  the  modeling  process.  Determining 
the  form  of  the  linear  or  nonlinear  equations  by  actually  matching  the  output  of  the  model 
to  the  observed  data  is  the  ultimate  goal  of  solving  the  inverse  problem. 

Applications:  In  general,  coefficients  reflect  scalings  and  parameters  (e.g.  amplitudes, 
frequencies)  of  physical  importance.  One  tries  to  set  up  experiments  in  which  only  one  (or 
possibly  a  few)  parameters  vary  and  then  seeks  to  recover  the  form  of  the  dependency  of  the 
unknown  coefficients  by  repeated  observation.  Unfortunately  (at  least  from  the  perspective 
of  those  who  must  solve  them)  most  equations  which  describe  physical  laws  are  in  fact 
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nonlinear.  The  forms  of  nonlinearity  may  be  difficult  to  determine.  Consider  the  following 
classical  problem  in  population  dynamics.  A  constant  growth  rate  A  for  a  species  leads  to 
the  growth  law 

du  . 

—  =  Au. 
dt 


This  is  unrealistic  in  that  it  leads  to  purely  exponential  growth  (or  decay).  A  more  reason¬ 
able  model  is  the  logistic  equation 


—  =  A(u  —  au2)  =  (A  —  au)u 
dt 


which  leads  to  bounded  growth.  In  reality,  the  population  may  have  a  more  complicated 
growth  law  of  the  form 

du  tt  \ 

T,  “  /(0)' 

If  we  could  monitor  the  solution,  u,  of  this  ordinary  differential  equation  over  a  time  interval 
t\  <  t  <  ti,  we  can  recover  /  immediately.  For  example,  if  u(t)  =  h(t)  on  the  interval 
h  <  t  <  *2,  then  by  substitution  into  the  differential  equation  we  have  h'(t)  =  f(h(t)). 
This  implies  that  /(£)  =  h'(/i-1(0).  This  observation  underlies  much  of  the  convergence 
results  in  the  parabolic  case  when  the  time  dependence  dominates  spatial  dependence  near 
the  overposed  boundary  [32,  34],  that  is,  when  juxa.(i0,t)|  <<  |ut(a;o,t)|. 

Another  natural  application  is  control  by  means  of  boundary  measurements.  If  the  flux 
on  a  particular  boundary  is  under  our  control,  and  depends  on  the  instantaneous  value  of 
the  dependent  variable  there,  we  then  have  a  relationship  of  the  form 


Given  a  desired  response  at  a  point  xq  on  the  boundary,  u(xo,t)  =  h(t),  we  wish  to  find 
the  unknown  control  /  which  achieves  this  target  state.  We  note  that,  in  practice,  inverse 
problems  and  identifiability  problems  in  control  tend  to  be  very  closely  related  [39]. 


3.  Model  Equations  and  Inverse  Problems 

We  will  discuss  various  inverse  problems  and  methods  for  the  parabolic  equation 

tit  =  L(x ,  t\  u)  on  Q  x  [0,  T ] 

G[u]  =  0  on  dSl  x  [0,  T] 

where  the  operator  L  and/or  the  boundary  operator  G  contains  an  undetermined  coefficient. 
Classical  examples  of  such  problems  axe 

c(u)«t  =  urx  +  7(1,  t)  c  >  c0  >  0  unknown  specific  heat 

««  =  0*(k(u)d*ti)  +  7 (x,t)  unknown  conductivity 

««  -  +  f(u)  +  y(x,t)  unknown  reaction  term 

du 

— (0 ,t)  =  F(u(0,*))  unknown  boundary  condition 

ox 
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These  are  canonical  single  coefficient  inverse  problems.  The  coefficients  are  functions  from 
R1  to  R1 .  In  order  to  recover  the  functional  dependency  we  should  prescribe  overposed  data 
with  the  correct  dimensionality.  The  overposed  data  should  map  a  subset  of  ft  x  [0,  T]  — ♦  R1 . 
These  problems  are  in  contrast  to  other  inverse  problems  where  the  unknown  coefficient 
depends  on  x  or  t  but  not  u : 


ut  =  a(t)Au  +  7(x,t) 
ut  =  V(a(x)Vu)  +  7(z,t) 
ut  =  At*  +  f(x,t) 
ut  =  Au  +  p(x)u. 

If  x  6  fl  C  Rn,  these  are  problems  where  the  unknown  coefficient  is  a  function  of  more  than 
one  variable.  Examples  of  such  problems  are  given  in  [7]. 

4.  A  Meta-Theorem 

Finding  exactly  the  right  space  of  overposed  data  is  important  in  order  to  guarantee  exis¬ 
tence  of  a  unique  solution  to  the  inverse  problem.  If  too  much  data  is  overposed,  there  may 
not  be  a  solution  to  the  inverse  problem.  If  too  little  data  is  overposed,  there  may  not  be 
a  unique  solution  to  the  inverse  problem.  Constraints  on  parameters  lead  to  compatibil¬ 
ity  conditions  between  the  primary  and  overposed  data,  which  can  be  quite  complicated. 
.  This  has  led  to  a  dichotomy  between  existence  results  and  uniqueness  results.  Consider  the 
problem  of  recovering  a(x)  in 

d  ,  .  . dux 
“*  -  = 0 

with  u(x,t)  satisfying  initial  data,  Neumann  boundary  conditions,  and  u(0,t)  =  h(t) 
given  as  overposed  data.  Furthermore,  suppose  the  o(x)  is  smooth  enough  so  that  Lu  = 
£(a(x)f£)  =  At*  has  a  complete  set  of  Lj  eigenfunctions,  satisfying 

L<j>j(x)  =  A  j<t>j(x). 

i-oo 

Writing  u(x,t)  =  ^  Uj(t)<j>j(x),  one  can  solve  for  the  coefficients  Uj(t)  and  easily  obtain 
i=o 


«(*><)  =  «i(0)eA,Vi(*) 

;=o 

The  tij(0)  are  determined  by  the  initial  data.  Evaluating  this  expression  on  x  =  0,  we 
obtain  a  Dirichlet  series  for  h(t).  This  implies  that  any  overposed  data  h(t)  =  u(0 ,t)  for 
reasonable  a  must  be  a  (very  small)  subset  of  the  class  of  analytic  functions.  This  is  a  very 
strong  compatibility  constraint.  Under  certain  conditions,  one  obtains  uniqueness  easily, 
but  existence  only  if  the  overposed  data  is  in  an  extremely  small  subclass  of  functions  [30]. 

One  could  argue  that  the  difficulty  is  that  the  overposed  data  is  in  the  wrong  direction. 
We  are  trying  to  recover  a  function  of  x  by  giving  data  in  the  t  direction.  The  following 
meta-theorem  was  formulated  in  the  late  1960’s  for  parabolic  inverse  problems  [6]. 
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META-THEOREM:  The  overposed  data,  and  the  unknown  function 

should  lie  in  the  same  direction. 

If  the  data  is  in  the  “correct”  direction,  the  problem  will  usually  be  well-posed.  If  data 
is  not  in  the  “correct”  direction,  the  problem  will  be  ill-posed.  For  example,  recovering  0(1) 
by  overposing  data  on  x  =  0  leads  to  an  extremely  ill  behaved  problem,  but  recovering  a(t) 
by  the  same  data  is  a  relatively  well-behaved  problem  [3,  4,  23,  24].  Overposing  final  data 
u(x,T)  leads  to  a  much  better  behaved  problem  for  coefficients  which  depend  on  x,  [38]. 


5.  Overposed  Data 

There  are  many  possible  measurements  one  can  perform  on  an  evolving  process.  In  some 
cases  the  experimenter  can  choose  the  form  of  the  overposed  (i.e.,  measured)  data.  This  is 
often  in  the  form  of  local  (pointwise)  data.  Two  common  examples  are 

i)  Dirichlet  type,  u(xq ,t)  =  h(t)  0  <  t  <  T\ 

ii)  Neumann  type,  du/dx(xo,t)  =  h(t). 

In  some  cases,  one  has  only  nonlocal  observations  -  for  example,  measurements  of 
spectra,  measurements  on  the  total  energy  in  the  system,  periodic  data,  or  time-integrated 
data.  Examples  of  nonlocal  data  relevant  to  parabolic  initial  value  problems  are 


a«(0,  t)  +  /3u(l,  t )  =  h(t), 

f  u(x,t)dx  =  h(t), 
Jo 

[  \u(x,t)\pdx h(t), 
Jo 

I  k(t  -  T)u(xo,r)dr  =  h(t). 
Jo 


For  examples  of  some  of  these,  see  [2,7]  and  the  references  contained  therein. 

Overposed  Dirichlet  Data:  The  simplest  type  of  overposed  data  is  Dirichl''.  data.  We 
will  consider  overposed  Dirichlet  type  at  first,  and  return  to  the  other  types  later. 

As  a  special  case  of  the  inverse  problem  ( l.l)-(  1.4),  we  examine  the  initial  boundary 
value  problem 

ut~uxx  =  f(u)  (5.1) 


with  overposed  data 


ux(0,t)  =  q0(t)  ux(l,t)  =  gi(t) 
u(z,0)  =  u0(x) 


u(0  ,t)  =  h(t). 


(5.2) 

(5.3) 

(5.4) 


Other  coefficient  problems  considered  can  be  found  in  references  given  at  the  end  of  this 
paper  [5,  3  -  13,  31  -  36].  In  general,  u  depends  on  /  in  a  complicated  way,  and  we  must 
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use  the  direct  problem  (5.1)-(5.3)  to  obtain  u(x,  t;  /).  Solving  the  inverse  problem  reduces 
to  solving  the  “residual  equation” 

R[f\  =  u(0,f,f)-h(t)  =  0  (5.5) 

for  /,  given  h.  In  other  words,  the  direct  problem  yields  /  •— ►  h,  and  we  must  invert  this 
to  find  h  i — ►  /.  The  analysis  of  this  mapping,  which  we  call  the  “datar  coefficient  mapping” 
is  fundamental  to  the  understanding  of  both  the  inverse  problem  and  solution  strategies. 

Properties  of  the  Dirichlet  Data-Coefficient  Mapping:  For  the  inverse  problem 
(5.1)-(5.4),  the  basis  of  many  of  our  residual  algorithms  is  the  Green’s  function  representa¬ 
tion 

u(x,t)  =  xl>(x,t)  +  J j  K(x,y,t  -  r)f(u(y,T))dydr  (5.6) 

where  i>  satisfies  (5.1)-(5.3)  with  /  =  0;  that  is,  r/>(x,t)  =  u(x,t;0).  A  necessary  condition 
for  solving  (5.5)  is  that  •  fy  be  nonsingular.  Restricting  (5.6)  to  the  overposed  boundary 
x  =  xo  leads  to 

u(x0,t)  =  t/>(z0,t)  +  J J  K(xo,y,t  -  r)f(u(y,T))dydT.  (5.7) 

Equations  (5.6)  and  (5.7)  form  a  coupled  set  of  first  kind  nonlinear  Volterra  integral  equa¬ 
tions.  They  contain  all  the  information  of  (5.1)-(5.4),  but  are  difficult  to  solve  in  this  form. 
Written  in  terms  of  partial  differential  equations  (and  evaluating  on  x  =  x0  =  0)  we  have 

«t(0,  t )  -  «rr(0,  f;  /)  =  /(u(0, 0)  (5.8) 


or  since  u(0,t)  =  h(t), 

=  h'(t)  -  ^ri(0,  t)  -  J J  Kxx(0,  y,  t  -  r)f(u(y,  r))dydr. 

This  amounts  to  converting  (5.7)  to  an  integral  equation  of  second  kind  by  differentiation. 
One  can  show  by  Gronwall  arguments  that  if  /  is  Lipschitz,  then  /  is  a  solution  of  the 
inverse  problem  if  and  only  if  it  is  a  solution  of  the  above  mapping  [34]. 

Historical  Methods  for  solving  the  “Residual  Equation”:  The  first  two  methods 
for  solving  equations  of  the  form  (5.5)  are  special  cases  of  parameter  identification.  In  these 

methods,  we  assume  that  /  can  be  described  by  a  finite  set  of  parameters  {ai,  c*2, . . . ,  a/v}; 

N 

that  is,  /  =  f(o An  example  of  this  would  be  /(£)  =  £  a«<A«'(0  where  the 

»=i 

4>i{x)  are  known  basis  functions.  The  first  method  we  consider  is 

a)  Weighted  Least  Squares.  Here  the  number  of  measurements  ( M )  may  exceed  the 
dimension  of  the  parameter  space  ( N ),  and  consequently  the  model  may  be  overdeter¬ 
mined.  For  /’ s  of  the  above  form,  we  solve  the  problem 

M  M 

min{  V  }.=  min{  V  Wi\u(x0,ti,  f{aj))  -  /i(t<)|2} 

«=i  i=i 
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where  the  Wi  >  0  axe  weights.  This  presents  a  difficult  optimization  problem,  since  the 
functional  may  have  many  local  minima  and  be  very  fiat  in  the  neighborhood  of  the 
minima.  “Output  least  squares”  methods  sometimes  have  these  types  of  difficulties. 

b)  Collocation  Method.  This  method  fixes  the  basis  functions  and  reduces  the  prob¬ 
lem  to  a  nonlinear  algebraic  system  of  equations. 

We  solve  the  equations  -R[/(a.j  )](*«)  =  u(0,  /(«j))  -  h(t{)  =  0  i  =  1, . . .  ,N.  This 

is  called  a  “collocation  method,”  since  we  set  the  residual  to  zero  at  a  finite  set  of 
collocation  points. 

Residual  Update  Methods:  We  now  discuss  several  mappings  (Newton,  Quasi-Newton, 
Homotopy)  which  are  related  to  one  another.  These  methods  depend  on  the  Gateaux 
derivative  of  u,  with  respect  to  /.  They  rely  on  the  fact  that  the  Gateaux  derivative 
satisfies  a  linear  partial  differential  equation.  To  see  how  the  function  it  depends  on  the 
coefficient  /,  we  form  the  quantity  u(x,t ;  f  +  s<f>). 

The  function  u(x,t;/+  s<f>)  satisfies  the  following  boundary  value  problem: 

ut  -  uxx  =  /(it)  +  s<j>( u)  (5.9) 

ur(o»0  =  ffo(0>  u*(i»  0  =  gi(t) 
u(i,0)  =  tto(x). 

Defining  the  Gateaux  derivative  by  it  =  £u(x,t;  /+  s<£)  J  =  J[f ]■<$>,  it  is  easily  seen  that 

u  satisfies  the  equation 

«t  -  «**  =  /'(«)«  +  4>{ u)  (5.10) 

u*(0,t)  =  0,  ur(l,t)  =  0  (5.11) 

it(x,0)  =  0  (5.12) 

where  it  =  u(z We  assume  sufficient  smoothness  so  that  the  Gateaux  derivative 
is  equivalent  to  the  Frechet  derivative.  Knowledge  of  the  functions  /  and  4>  determine 
the  partial  differential  equation,  (5.8),  and  its  solution,  completely.  In  fact,  we  have  the 
representation  formula 


u(x,t)=  f  f  K(x,y,t-T)(j>(u(y,T))dydT 
Jo  J  n 

where  K  is  the  Green’s  function  for  the  operator  dt  -  dxx  - 

Based  on  the  fact  that  the  residual  R[f]  is  the  primary  source  of  information  about 
model  error,  it  is  reasonable  to  believe  most  effective  iterative  recovery  methods  are  equiv¬ 
alent  to  some  type  of  residual  update  scheme.  For  the  problem  (5.1)-(5.4),  these  schemes 
are  generally  of  the  form 


/(n+1>  =  /(n)  +  JF[i?(...  ;/<”>)] 

=  /n)  +  ^[«(0,  <;/<">)  -MO] 
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where  T  vanishes  only  at  the  origin.  We  have  a  solution  (and  convergence)  if  and  only  if 
R  =  0.  The  iteration  sequence  is  equivalent  to  finding  a  fixed  point  of  the  mapping 

/  =  T[f]  =  [I  +  JroRouo]f 


defining  the  iteration  scheme 

/<n)  , 


u 


(") 


R(n) 


/("+1) 


This  is  the  function  space  analog  of  the  “Matrix  splitting  methods”  used  in  numerical  linear 
algebra.  Convergence  depends  on  the  Frechet  differentiation  of  the  map  T.  If  the  operator 
/  +  ^  •  |^  is  contractive,  we  may  expect  some  type  of  convergence.  T  should  be  chosen, 
in  some  sense,  to  guarantee  the  convergence  of  the  iteration  scheme  (as  a  contraction).  It  is 
precisely  the  choice  of  T  which  distinguishes  the  various  iteration  schemes  from  one  another. 

As  an  example  of  this  general  method,  we  have  the  following  schemes: 
a)  Newton-Raphson.  In  this  method  we  seek  to  solve  the  nonlinear  equation 


by  the  iteration  scheme 


Rif]  =  “(o.  <;  /)  -  MO  =  o 


/"+,>  =  /">  -  (fj) 


’V"’)  ■•Rtf"0]. 


Letting  J  =  ^  we  have 

/»+!  =  /»  ~  J[fn\~l.R[fn]  =  fn  ~  -(«(0,  t;  fn )  -  h(t)). 

Rearranging,  this  becomes 


(5.13) 


AU{fn+i-fn)=h(t)-u(0,f,fn). 

We  seek  to  recover  /n+i  from  this  equation.  Letting  u  =  J[fn]-{fn+x  -  fn )»  one  can 
show  that  u  satisfies 


-  Uxx  =  f'n(un)u  +  /„+i(un)  -  /n(un)  (5.14) 

ur(0,0  =  0,  ur(l,0  =  0  (5.15) 

u(x,0)  =  0  (5.16) 

and  on  the  overposed  boundary  x  =  0, 

u(0,0  =  MO-«(<M;/n).  (5.17) 


We  therefore  have 
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where  K  is  the  Green’s  function  for  the  operator  dt  -  dxx  —  /' ( un)  I.  This  is  a  first-kind 
Volterra  integral  equation,  with  convolution  kernel. 

We  can  also  recover  /n+i  by  evaluating  (5.14)  on  the  overposed  boundary  x  =  0,  using 
the  boundary  condition  (5.17)  to  get 

h'(t)  -  ut(0,  t;  /„)  -  u«(0,  f;  /(n+1) 

=  /;(«„( 0,  t))(h(t)  -  un(0,  t ))  +  /n+i (un(0,  t))  -  f„(un(0,  t)).  (5.18) 

This  is  itself  a  nonlinear  equation  for  f^n+1\  and  must  be  solved  by  iteration. 

b)  Quasi-Newton.  In  this  method  we  replace  J[fn\~l  by  a  fixed  operator  K 

f(n+l)  =f(*)-K.  £[/(">]. 

A  natural  Quasi-Newton  method  may  be  defined  by  replacing  /£(un)  by  fn(h)  in  (5.13). 
This  has  the  effect  of  changing  the  linear  operator  J,  and  modifying  the  update  scheme. 
Another  possible  scheme  would  be  to  keep  J[fn\  fixed  for  several  iterations.  The  success 
of  these  schemes  is  highly  problem  dependent.  For  an  arbitrary  choice  of  K,  one  can  not 
necessarily  reduce  this  to  a  projection  scheme  for  a  local  partial  differential  operator. 

c)  Homotopy.  In  this  method  we  have 


/'+A'>  .  M  +  (*«) 


\f{n))  ■  [h('+A,)  -  h(s)] 


where  parameterizes  the  overposed  data  and  is  the  actual  observed  data. 
This  method  can  be  thought  of  as  a  continuation  scheme.  An  initial  guess  /o  is  selected 
and  the  direct  problem  solved  to  recover  the  corresponding  boundary  values  ho(t)  = 
u(0,f;  /o).  We  let  hj(t)  =  h(t),  the  actual  overposed  data,  and  form  the  parameterized 
values  h3(t)  =  (1  —  s)ho(t)  +  shi(t),  moving  from  the  initial  state  to  the  desired  state 
as  s  moves  from  0  to  1.  We  wish  to  find  the  corresponding  reaction  terms  f3(t), 
that  accomplish  this.  The  function  f\  corresponds  to  the  overposed  data  hi  =  h  and 
therefore  is  the  homotopy  solution  of  our  problem.  At  each  value  of  the  parameter  we 
linearize  the  problem  about  the  function  f,  and  move  in  the  direction  dictated  by  the 
overposed  data.  This  corresponds  to  implementing  the  scheme 


J[fa]-(fa+&a  ~  fa)  —  ^j+A a  ~  h3 

which  can  be  written  symbolically  as 

|j.A/  =  A/> 

and  is  therefore  equivalent  to  Euler’s  method  in  parameter  space  s  G  [0, 1]. 

In  a  similar  manner  to  Newton’s  method,  we  can  obtain  a  partial  differential  equation 
for  the  updates  u.  The  result  is 


«t  -  «xr(0,t;/(,))  =  f'3{u3) ft  +  /,+ a#(«#)  -  /»(«*) 


(5.19) 
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ur(0,t)  =  0,  «*(l,t)  =  0 

(5.20) 

ti(x,0)  =  0 

(5.21) 

and 

«(0,t)  =  ^»+a<(0  —  ^*(0* 

(5.22) 

Using  the  representation 

«(0 ,<;/)=  /  /  ^(0,y,t-r)[/J,+A,(«,(y,T))-/J,(u,)(y,r))]dyrfr 
JO  Jo 

we  again  obtain  a  first  kind  VolterTa  equation  for  /4+a»?  of  convolution  type,  where 
K  is  the  Green’s  function  for  the  operator  dt  -  dxx  -  /'(«,)!.  Using  the  additional 
boundary  data,  we  can  obtain  an  update  scheme  for  /s+a« 

—  ka(i)  —  ttrr(0  ,U/j+a»  = 

f',(ua( 0,  t))(h9+Ut)  -  h.(t))  +  /.+  As  MO,  t ))  -  /,K(0,  t ))  (5.23) 

The  common  theme  for  the  previous  three  methods  is  the  fact  that  each  of  the  update 
schemes  computes  an  update  / by  means  of  the  solution  to  a  linearized  partial  differential 
equation.  By  projecting  this  onto  the  overposed  boundary  one  can  recover  the  update 
function  from  (5.18)  or  (5.23). 

Non-Residual  Schemes:  We  now  approach  the  inverse  problem  (5.1)-(5.4)  by  a  different 
method.  Solving  (5.8)  for  /  in  terms  of  h ,  we  obtain 

f(s)  =  h'(h~\s))  -  uxx(0,  h~\s)).  (5.24) 

If  we  eliminate  /  from  (5.1),  using  (5.24),  we  obtain  the  Trace  Type  Functional  (or  TTF ) 
equation 

ut(x,  t )  -  uxx(x,  t)  =  h'ih-'iuix,  t )))  -  usx(0,  h_1( u(x,  f)))  (5.25) 

In  order  for  this  equation  to  be  well  defined,  the  overposed  data  must  satisfy  the  constraint 
0  <  h~l(u(x,t))  <  t.  The  direct  problem  -  that  is,  (5.25)  along  with  the  primary  data  -  is 
to  be  solved  for  u(x,  t ),  with  /  recovered  by  (5.24). 

The  difficulty  with  this  method  is  the  form  of  the  nonlinearity.  Some  results  have  been 
obtained  for  the  time  dependent  coefficient  case  by  finite  differencing  the  nonlocal  uxx  term 
[13].  Recent  results  for  the  TTF  method  are  contained  in  [12,  14-16].  [For  a  more  thorough 
survey  of  the  TTF  method  and  its  application  to  various  inverse  problems,  see  the  paper 
by  Cannon  and  DuChateau  in  this  Proceedings.] 

If  we  eliminate  u  from  (5.8)  by  using  the  representation  formula 

u(x,t)  =  rj>(x,t)  +  J j  K(x,y,t  -  T)f(u(y,r))dydT 

we  obtain  the  Fixed  Point  Projection  (or  FPP)  method 

f(h(t))  =  h'(t)-il>t(0,t)-j j  Kxx(0,y,t  -  T)f(u(y,r))dydT. 


(5.26) 
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We  remark  that  there  are  some  deep  underlying  connections  between  the  FPP,  TTF, 
and  collocation  methods.  We  actually  solve  the  FPP  equation  by  the  equivalent  iterative 
method  in  which  we  update  /  by 

fin+1)W))  =  *'(0  -  t&OM)  =  m  -  uxx(0 ,*;/<">), 

and  update  u  by  solving  the  nonlinear  partial  differential  equation 

t4n+1)  -  4”+1)  =  /<n+1)(u(n+l)). 

Eliminating  /(n+1)  from  the  above  two  equations,  we  see  this  is  equivalent  to  the  iteration 
scheme 

t4n+1)  -  t4”+1>  =  h'(/rVn+1)(M)))  -  uin](0,h-\uln+l\x,t)))  (5.27) 

which  corresponds  to  a  semi-implicit  numerical  strategy  for  the  TTF  equation  (5.25).  Equa¬ 
tion  (5.27)  is  implicit  in  u  and  explicit  in  uxx ,  and  is  solved  in  time  by  a  Crank-Nicholson 
or  Tri-level  scheme  [31,  32,  34].  A  fully  explicit  solution  of  (5.25)  would  correspond  to  the 
equation 

u\n+1)  -  «Lnx+1)  =  h'(h-\u^(x,t)))  -  uxx(0,h-\u^(x,t)))  (5.28) 

=  /<">(«<">(*,  *))• 

A  fully  implicit  solution  of  the  TTF  equation  would  correspond  to  the  equation 

t4n+1)  -  u£+1>  =  V(fc-,(tt<"+1)(*,<)))-  4^f,)(0,A"1(«(w+1)(*,0))  (5.29) 

which  corresponds  in  turn  to  the  coupled  scheme 

U<”+1)  _  tt<£+l>  =  ^(«+D(u(n+l)) 

and  is  iterated  back  and  forth  at  each  time  step.  Suppose  we  have  solved  the  TTF  equation 
by  this  method  for  0  <  t  <  tj.  Because  of  the  monotonicity  of  the  data  and  the  range 
condition,  it  can  be  seen  that  the  /<n+1)  are  changed  only  on  the  interval  [h(tj),h(tj+ 1)]. 
Having  determined  the  /  over  some  range,  it  is  forever  fixed.  This  may  also  be  interpreted 
as  solving  the  nonlinear  Volterra  integral  equation  (5.26)  for  t  >  tji 

f  Kxx(0,y,tj+i  -  T)f(u(y,T))dydr  (5.30) 

Jo 

-  T)f(u(y,T))dydr 

which  requires  inverting  the  kernel  Kx r(0,  y ,  t )  for  small  t.  The  first  three  terms  are  known, 
since  u(x,t)  €  [/i(to),  h(tj)}  for  0  <  t  <  tj  by  the  range  condition. 

If  we  collocate  on  h'(t)  -  that  is,  we  choose  an  /  satisfying  ut(0,fj,  /)  =  h'(tj)  -  we 
have  ^ 

f(u(tj))  =  h'(tj)  -  ~  f  f  Kxx(0,y,tj  -  T)f(u(y,r))dydr. 

Jo  Jo 


f(h(t))=h'(t)-Mo,t)~  r 

Jo 

-  f  f1  Kxx(c 

Jtj  Jo 
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This  is  equivalent  to  solving  (5.30)  by  a  piecewise  linear  function  /  if  we  also  collocate  on 
h(t)  (setting  u(0,  t;)  =  h(tj))  and  choose  the  collocation  points  to  correspond  to  the  actual 
time  steps.  Collocating  only  on  h'(t)  leads  to  small  differences  between  the  methods,  as 
long  as  a  piecewise  linear  /  =  /  is  used  to  solve  (5.30).  If  there  is  a  monotone  relationship 
between  the  overposed  data  and  the  unknown  coefficient,  collocation  methods  can  be  used 
to  solve  the  TTF  equation.  On  the  other  hand,  implementing  the  FPP  method  corresponds 
to  a  particular  numerical  strategy  for  solving  the  TTF  equation. 

Other  Types  of  Overposed  Data:  In  one  dimension,  it  is  often  possible  to  switch  the 
roles  of  overposed  and  primary  data.  (This  amounts  to  showing  that  the  overposed  data  is 
admissible  as  primary  data).  In  higher  dimensions,  this  strategy  may  not  work.  Consider 
the  inverse  problem 

u(-Au  =  /( u)  infix  [0,T]  (5.31) 

xi  —  h  on  dSl  x  [0,  t]  (5.32) 

with  the  overposed  Neumann  data 


=*,«)•  (5.33) 

The  overposed  data  lie  on  a  curve  of  dimension  one.  Using  the  representation  theorem 
u(x,t)  -  ^(x,t)  +  j J  K(x,y,t-T)f(u(y,T))dydT 
we  immediately  obtain  the  “residual  equation” 


_  /,x  _  du /_  dxP(x0,t)  ,  f  f  dK{x0,y,t 

*■<*>-  ■&**<  =  Tv  +JJ - air 


r) 


/(u(y, r)dydr 


which  is  a  Neumann  data-coefficient  map.  This  is  a  first-kind  nonlinear  integral  equation 
for  f(xi).  It  is  not  clear  how  to  convert  it  to  second  kind  by  any  simple  means.  By  analogy 
to  the  Jacobi  iteration  method,  one  possibility  is  to  subtract  off  the  diagonal  terms  to  obtain 
the  second-kind  equation 

/  k(t  -  r)f(h(t))dT  = 

Jo 

=  ^~ +  J J  T))dydT- 

where  k(t)  =  K(xo,y,t)dy.  In  general  this  leads  to  inversion  of  integral  equations  with 
fractional  order  kernels,  whose  inverses  are  not  local  differential  operators. 

Another  phenomenon  manifests  itself  with  nonlocal  overposed  data.  Consider  (5.1)- 
(5.3)  with  overposed  data  of  the  form 
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Integrating  (5.1)  over  [0, 1],  we  immediately  obtain 

E'(t)  =  gi{t)  -  g0(t)  -  f  f(u{y,t)dy.  (5.34) 

Jo 

This  is  a  first-kind  nonlinear  Fredholm  integral  equation,  with  a  smooth  kernel  (K  =  1). 
This  is  not  a  pleasant  situation;  however,  one  can  easily  show  that  /i  >  fi  =>  «i  >  U2  =>• 
Ei  >  f?2,  that  is,  E  is  a  monotone  function  of  /.  This  will  therefore  yield  a  unique  piecewise 
linear  function  /,  by  the  collocation  method,  satisfying  E(tj)  =  f(u(x,tj)dx. 

In  both  of  these  cases,  it  is  not  immediately  clear  how  to  implement  either  the  TTF  or 
FPP  methods. 

6.  Existence,  Uniqueness,  Regularity 

Monotonicity  Methods:  If  u  is  a  monotone  increasing  function  of  the  unknown  co¬ 
efficient  /,  then  u(xo,t;fi)  >  u(xo,t;/2 )  if  fi  >  fi •  As  a  result,  collocation  methods  are 
readily  available.  If  the  initial  value  /o  is  known  (usually  from  consistency  arguments),  then 
on  the  first  interval  «o  <  u  <  «t  we  have  f(u)  ss  f0  +  Mo(u  -  u0),  where  M  is  the  slope  to 
be  determined  by  matching  u(0,  /)  =  h(t\).  Because  of  the  monotonicity,  one  can  easily 

show  that  there  is  at  most  one  value  of  Mq  which  satisfies  this  equation.  In  order  for  this 
to  work  we  must  have  the  range  condition  u(z,t)  6  {u(0,t)}  =  {h(t)}.  We  may  compute 
the  successive  values  of  Af,  by  this  process,  as  long  as  the  overposed  data  h(t)  is  monotone. 
Monotonicity  methods  have  been  applied  to  inverse  coefficient  problems  in  [18]. 

Uniqueness:  Uniqueness  of  undetermined  analytic  coefficients  has  been  shown  in  a 

wide  variety  of  instances.  These  rely  on  the  property  that  any  analytic  function  with  an 
infinite  set  of  zeros  on  a  finite  interval  has  an  accumulation  point  and  therefore  must  vanish 
identically.  In  one  instance,  this  follows  from  the  result  that  the  difference  between  two 
solutions  of  the  inverse  problem  must  vanish  infinitely  often  on  an  arbitrarily  small  interval 
if  they  agree  at  the  left  endpoint,  [19].  Uniqueness  results  have  also  been  obtained  by 
utilizing  contraction  mapping  arguments  [31-36],  and  monotonicity  [18]. 

Energy  estimates  on  the  difference  of  two  solutions  have  been  obtained  by  [14,  27  —  29]. 
These  lead  to  uniqueness  and  stability  (identifiability)  results  in  various  cases. 

Existence:  Existence  questions  are  still  a  major  source  of  difficulty.  In  fact,  existence 
may  not  be  the  appropriate  question.  Existence  requires  that  the  overposed  data  lie  in  the 
range  of  the  nonlinear  map  /  t— *  u(xo,t]  f)  =  h.  This  consequently  requires  that  the  model 
(and  hence  the  mapping)  be  known  exactly,  and  that  the  data  h(t)  be  known  exactly  for  a 
continuum  of  values  0  <  t  <  T . 

The  fixed  point  methods  that  we  use  (based  on  the  FPP  algorithm)  do  not  yield 
existence  at  the  present  time  because  we  assume  the  existence  of  a  solution  to  the  inverse 
problem,  and  show  that  with  sufficient  smoothness,  it  is  a  regular  attractor.  We  then  use  the 
contraction  properties  of  the  mapping  to  show  uniqueness  [31-36].  Although  under  certain 
assumptions  /  6  Lip\  yields  h'  €  Lzpi,  the  range  of  the  nonlinear  map  may  not  in  fact  be 
onto  a  ball  in  Lip\.  If  it  is  not  onto,  then  we  may  expect  that  further  constraints  on  the 
overposed  data  are  necessary.  Because  of  the  discrete  nature  of  overposed  data,  numerical 
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errors  associated  with  actual  implementations,  and  noise  in  the  data,  determining  whether 
the  overposed  data  actually  arises  from  a  function  in  the  admissible  class  is  less  important 
than  whether  the  overposed  data  can  be  approximated  in  the  limit  (in  some  norm)  by 
a  sequence  of  approximations  (which  are  themselves  admissible).  The  robustness  of  the 
algorithm  (that  is,  the  sensitivity  of  the  algorithm  to  noisy  data)  can  therefore  be  checked 
by  perturbing  the  primary  and  overposed  data. 

7.  Numerical  vs.  Analytic  Methods 

Because  of  the  intrinsic  nonlinearity  of  the  direct  problems,  one  must  get  very  precise 
estimates  on  the  forward  map  /  — ►  u[x,  t;  /],  [20,26].  The  whole  arsenal  of  nonlinear  partial 
differential  equation  methods,  boundary  estimates,  and  singular  integral  equation  theory 
must  be  used.  Because  of  this  complexity,  numerical  simulations  are  very  useful.  Reducing 
problems  to  finite  dimension  is  not  without  its  drawbacks,  but  “real”  data  is  always  discrete 
anyway. 

Inverse  methods  based  on  modifications  of  the  FPP  method  usually  have  little  overhead 
in  update  strategy  with  respect  to  direct  solver.  This  results  from  the  fact  that  each  iteration 
of  the  FPP  method  is  equivalent  to  one  direct  solve,  and  the  FPP  method  is  observed  to 
converge  in  a  very  small  number  of  iterations  (typically  less  than  five).  The  update  consists 
of  differentiating  the  numerical  solution  near  the  boundary,  and  smoothing  the  result  with 
a  smoothing  spline.  This  costs  very  little,  compared  to  the  cost  of  obtaining  the  numerical 
solution  (via  a  direct  solve).  This  implies  that  speed  of  convergence  and  stability  are  more 
important  than  cost  of  update  strategy  (at  least  for  evolution  equations).  To  summarize, 
we  observe  that: 

RULE  OF  THUMB:  Good,  efficient,  accurate  direct  solvers  are  essential 

before  one  tackles  inverse  problems  numerically. 


8.  Discrete  vs.  Continuous  Methods 

Once  one  has  discretized  the  inverse  problem  (with  finite  difference  or  finite  element  meth¬ 
ods),  smoothness  is  no  longer  really  the  issue.  All  functions  in  this  setting  are  piecewise 
linear,  or  piecewise  polynomials  of  low  order.  We  must  approximate  the  “exact”  data  by 
discrete  data,  therefore  introducing  errors  into  the  overposed  data.  Noise  and  model  error 
add  to  the  difficulties.  It  is  important  to  check  the  “robustness”  of  the  inverse  method. 
Numerically,  methods  may  be  very  stable,  in  spite  of  the  lack  of  an  existence  or  uniqueness 
theorem.  In  fact: 

RULE  OF  THUMB:  Good,  stable  numerical  results  usually  indicate  the 
existence  of  a  mathematical  theorem! 

The  role  of  “numerical  experimentation”  has  proven  to  be  a  valuable  tool  for  testing 
hypotheses  and  conjectures.  The  “best”  space  for  the  posing  of  the  discrete  problem  is 
not  always  clear  from  the  problem  itself,  but  it  can  be  crucial  to  the  performance  of  the 
numerical  algorithm.  For  example  if  one  applies  the  collocation  method  to  (5.1)-(5.4),  and 
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uses  only  the  overposed  data  to  collocate,  i.e.  u(0,tj;/)  =  h(U),  very  poor  results  axe 
obtained.  It  is  much  better  to  minimize  ju(0,t<;  /)  -  h(ti) |  +  A|tit(0,tj;/)  -  or  even 

to  collocate  on  h'(t),  that  is  set  ut(0,U\f)  =  h'(U).  This  results  from  the  fact  that  the 
mapping  from  h  -*  f  is  not  bounded  but  h'  — ►  /  is. 

Regularizations:  Often,  one  must  perform  certain  “regularizations”  of  the  algorithms  in 
order  to  maintain  stability.  For  example, 

a)  A  priori  bounds  (e.g.,  on  the  specific  heat,  Cx  >  c  >  Co)  may  have  to  be  imposed  by 
truncating  the  iterates.  If  the  iterations  settle  down  so  that  truncation  is  unnecessary, 
then  truncation  appears  only  as  part  of  the  “startup”  cost  for  the  algorithm  (that  is, 
in  obtaining  a  sufficiently  good  initial  guess). 

b)  The  derivatives  of  the  discrete  data  will  usually  have  to  be  smoothed  and  interpolated 
(e.g.,  by  a  smoothing  spline).  Again,  if  the  iterates  become  smooth  enough  so  that  this 
is  unnecessary  in  the  limit,  it  becomes  part  of  the  startup  cost. 

c)  Nonuniform  weighting  of  the  residuals  may  be  imposed.  Initial  errors  may  be  larger 
than  errors  at  a  later  time.  Requirements  on  smaller  time  steps  or  more  closely  spaced 
overposed  data  are  typical  startup  costs. 

There  are  numerous  ways  to  “regularize”  inverse  problems.  If  the  regularizations  disap¬ 
pear  in  the  limit,  they  are  fairly  harmless.  If  they  persist,  then  one  is  solving  a  “perturbed 
problem.” 

9.  Multiple  Unknowns 

Many  of  the  methods  discussed  in  this  survey  can  be  extended  to  multiple  coefficient  in¬ 
verse  problems.  An  example  of  a  multiple  coefficient  problem  is  the  recovery  of  the  solution 
triple,  ( u,  c,  /),  in  c( u)ut  -  Aa  =  /( u)  from  (additional)  overposed  data.  Another  situa- 
tion  occurs  in  the  equation  ut  -  Att  =  /,  when  the  forcing  function  /  might  be  known  to 
be  a  function  of  both  u  and  Vu.  The  projection  methods  are  not  able  to  handle  the  true 
multivariable  case  directly,  but  some  further  restrictions  on  the  nature  of  the  multivariable 
dependence  allow  the  projection  methods  to  be  applied.  One  method,  which  may  be  fea¬ 
sible  under  a  wide  variety  of  circumstances,  is  to  assume  that  the  forcing  function  can  be 
approximated  by  the  two-term  Taylor  series  /(«,Vu)  as  /^(u)  +  /(u).Vu.  The  problem 
can  then  be  treated  as  an  attempt  to  recover  the  set  of  functions  (u ,  /). 

One  can  visualize  problems  of  the  form 


c(u)uf  -  V.(k(u)Vu)  =  f(u)  -|-  7 
du 

T  =  s(“) 

u(x,0)  =  u0(aO 


with  n  >  1  unknowns.  For  a  single  dependent  variable,  we  have  essentially  one  observation 
point  where  Range  {«}  €  {u(xo,t)}.  In  order  to  distinguish  between  coefficients  we  need 
multiple  runs  in  general  (choose  different  7's  or  uo(x)'s).  There  is  much  more  opportunity 
for  degeneracy,  however.  If  the  data  is  very  flat  ie  |V2u|  «  0  then  c(u)u<  ~  /(u)  and  we  get 
h'(t)  ~  f(h)/c(h)  and  only  (f/c)  is  determined,  unless  7^0.  We  obtain  conditions  on  the 
Jacobian  of  the  matrix  which  are  more  difficult  to  enforce  or  verify. 
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Recovering  {u,  / )  from  the  system  of  equations 

ut  -  Au  =  f(u)  +  f 

is  one  of  the  best  type  of  multiple  unknown  problems,  since  the  matrix  multiplying  the 
unknowns  /  is  diagonal.  Issues  in  multi-coefficient  problems  are  still  poorly  understood, 
although  some  work  has  been  done  [31]. 

10.  Further  Questions  and  Open  Problems 

There  are  many  function-analytic  methods  for  solving  the  nonlinear  equations  of  the  form 

£[«[...;/]]  =  0 

in  various  kinds  of  spaces.  Some  of  these,  as  we  have  shown,  can  be  recast  as  iterated 
solutions  of  various  perturbed  equations  and  therefore  lend  themselves  to  schemes  like  the 
fixed  point  projection  methods.  Others  are  more  direct  attacks  using  the  integral  repre¬ 
sentations  for  the  solution,  or  rely  on  properties  of  the  data-coefficient  mapping  such  as 
monotonicity.  Still  others,  based  on  the  TTF  method,  rely  on  existence  theory  for  abstract 
Cauchy  problems.  Classifying  the  various  methods  in  a  more  unified  framework  is  yet  to  be 
done.  Classifying  ill-posed  problems  as  to  the  degree  of  ill-posedness  has  been  am  ongoing 
research  topic  for  many  years  [see  the  tadk  by  Seidman  in  this  Proceedings]. 

If  the  unknown  coefficient  involves  the  gradients  of  the  dependent  variables,  much  less 
is  known.  The  update  schemes  are  less  stable,  and  more  prone  to  error,  particular  in  the 
discrete  case.  Numerical  differentiation  of  data  is  always  tricky.  When  the  overposed  data 
involves  gradients  in  higher  dimensions,  or  is  nonlocal,  it  is  not  immediately  clear  how  to 
apply  either  the  FPP,  collocation,  or  TTF  method. 

11.  Summary 

We  have  observed  that  for  many  inverse  coefficient  problems  in  parabolic  equations,  recov¬ 
ery  methods  generally  fall  into  two  categories  -  residual  update  and  non-residual  update 
strategies.  The  residual  update  strategies  we  have  examined  are  the  FPP,  Newton,  Quasi- 
Newton,  and  Homotopy  methods.  The  convergence  of  these  methods  relies  on  the  Frdchet 
derivative  of  the  DatarCoefficient  mapping.  The  non  residual  schemes  considered  were  least 
squares,  collocation,  and  the  TTF  method.  These  last  two  were  shown  to  be  closely  re¬ 
lated  to  the  FPP  method  when  actually  implemented  numerically.  The  TTF  method  is 
formulated  in  the  terminology  of  an  abstract  Cauchy  problem,  while  the  FPP  method  is 
formulated  in  terms  of  a  fixed  point  mapping. 

The  question  of  classifying  inverse  problems  as  to  the  degree  of  ill-posedness  is  at 
least  subjectively  possible,  through  the  existence  of  a  Meta-Theorem,  examination  of  the 
regularity  of  the  data-coefficient  mapping,  and  existence  and  uniqueness  results  for  abstract 
Cauchy  problems  of  TTF  form. 

Existence  of  solutions  to  the  inverse  problems  is  a  difficult  issue,  and  is  closely  tied  to  the 
issue  of  compatibility  of  overposed  data  with  the  actual  model  considered.  Of  more  practical 
importance  is  constructibility ,  stability,  and  uniqueness  for  solutions,  given  a  definite  form 
for  the  model.  When  the  model  itself  is  in  question,  existence  is  replaced  by  constructibility 
of  approximations  in  Hadamard’s  triumvirate,  which  historically  has  defined  well-posedness. 
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Introduction. 


Consider  the  initial-boundary  value  problems  on  [0, 1]  x  [0,  T\ , 


ut  -  uxz  =  7 (x,t) 
u(x,0)  =  u0(x) 

«*(0  ,t)  =  f(u(0,t)) 
MM)  =  -f(u(l,t)) 


Ut-uxx  =  7  (x,t)  +  f(u) 
u(x,0)  =  u0(x) 

MM)  =  go(u(0,t)) 
MM)  =  —g\  («(M)) 


The  functions  uo ,  7 ,  g  1  and  <72  axe  assumed  known.  Given  sufficient  smoothness 
on  and  knowledge  of  the  function  /(u),  the  above  direct  problems  have  a  unique 
solution  for  u(x,t).  However,  we  envision  the  situation  where  the  function  /  is 
unknown,  save  that  it  depends  only  on  u,  and  wish  to  determine  both  u(x,t )  and 
/(u)  by  making  additional  boundary  measurements.  We  shall  denote  by  u(x,  t;  f) 
the  solution  of  the  direct  problem  for  a  given  function  / .  For  further  discussion  of 
the  direct  problem  and  the  modeling  of  such  boundary  conditions  see  the  monograph 
[11- 

Equations  (B)  describe,  for  example,  the  diffusion  of  heat  in  a  uniaxial  bar  with 
nonlinear  boundary  conditions  at  the  ends.  Equations  (I)  describe,  for  example, 
the  diffusion  of  heat  in  a  uniaxial  bax  with  nonlinear  source  terms.  Our  prob¬ 
lem  is  therefore  to  determine  the  unknown  temperature  -  dependent  source  term 
(or  flux-temperature  function)  /(u)  from  a  knowledge  of  the  initial  temperature 
distribution,  and  a  measurement  of  the  temperature  u(x,t )  at  one  boundary. 

We  shall  first  consider  the  case  of  recovering  the  pair  (u,  /)  in  (B).  Why  should  one 
be  interested  in  recovering  the  boundary  term  /?  In  many  materials,  the  actual 
radiation  mechanism  is  not  known  perfectly,  and  one  must  use  an  approximation. 
Many  such  approximations  sire  possible.  Perfect  blank  body  radiation  into  space 
(with  zero  ambient  temperature)  is  governed  by  the  law  du/dv  =  <ru4  where  <7  is 
Boltzmann’s  constant,  and  du/dv  denotes  the  derivative  normal  to  the  boundary. 
In  a  linear  model  (Newton’s  law  of  cooling),  one  assumes  that  the  heat  leaves  the 
ends  of  the  bar  with  the  flux  proportional  to  a  temperature  difference,  du/dv  = 
k(u  —  Uqo).  We  propose  a  method  for  obtaining  the  exact  form  of  the  dependence 
of  the  flux  on  temperature  in  the  nonlinear  case. 

If  the  boundary  satisfies  a  known  radiation  law  du/dv  =  h(u)  but  the  incoming  flux 
Q  is  temperature  dependent  and  unknown,  we  have  du/dv  =  Ji(u)  +  Q(u)  =  f(u). 
This  cam  be  viewed  as  a  problem  in  nonlinear  control  theory.  The  target  set  is 
the  desired  temperature  response  {/i(t;)}  and  the  unknown  function  Q(-)  is  the 
control.  We  will  show  that  for  any  target  set  of  monotone  data,  a  piecewise  linear 
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function  /  exists  with  the  property  that  the  solution  u(x ,  t ,  /)  to  (B)  with  f  =  f 
satisfies  u(Q,tj,f)  = 

At  best  we  will  only  be  able  to  recover  the  function  /(u)  for  those  values  taken  on 
by  u(0,  t)  for  0  <  t  <  T.  It  is  thus  necessary  that  the  range  of  values  of  the  function 
u(x,t)  for  x  €  [0,1]  be  contained  in  the  range  of  values  of  u(0,<)  for  0  <  t  <  T. 
In  addition,  we  require  the  overposed  data  to  be  monotone  in  t .  This  monotonicity 
property  is  essential  to  recover  the  function  /  from  a  knowledge  of  f(h).  We  shall 
make  no  attempt  to  completely  characterize  the  class  of  allowable  overposed  data 
h(t).  Instead  we  shall  assume  that  for  some  function  /(u),  the  data  u(0,  f;/)  is 
given,  and  show  that  the  unknown  radiation  function  can  be  reconstructed  from  the 
overposed  data.  We  define  the  residual  of  the  mapping  from  h  t-+  /  as  u(0,  t;  /)  — 
h(t).  Requiring  this  to  vanish  at  a  discrete  set  of  points  tj ,  generates  a  collocation 
scheme. 

In  a  recent  paper  [2],  the  authors  proved  the  existence  of  a  unique  solution  to 
(B)  with  the  overposed  data  u(0,  t)  =  h(t).  The  solution  of  the  inverse  problem 
was  obtained  by  an  iteration  scheme  using  the  boundary  condition  itself  as  the 
update  algorithm.  This  method  can  be  summarized  as  follows.  For  a  given  func¬ 
tion  /,  letU(x,t;  f)  denote  the  solution  to  (B)  except  that  the  overposed  data 
£f(0,t)  =  h(t)  is  used  in  place  of  the  condition  u*(0,i)  —  /(u(0,t)).  If  in  addi¬ 
tion  the  function  U  also  satisfies  this  condition,  then  we  must  have  the  identity 
f(h(t))  =  Ux(0,  t;  f)  =  T[/].  Indeed,  one  can  show  that  under  suitable  condi¬ 
tions,  there  exists  a  unique  fixed  point  of  the  mapping  T,  and  that  this  solves 
the  inverse  problem.  A  similar  approach  can  be  used  for  the  problem  (I).  Eval¬ 
uating  the  differential  equation  ut  —  uxx  —  9  +  /  on  the  line  x  =  0,  we  obtain 
f(m)  =  {A'(l)  -7(0,f)}  -  *„(0,f;/)  s  T[/]. 

Since  we  have  interchanged  the  primary  and  overposed  data  along  one  boundary, 
this  method  for  solving  the  boundary  recovery  problem  (B)  is  limited  to  one  spatial 
dimension.  The  fixed  point  formulation  of  (I)  cam  be  carried  out  in  higher  dimen¬ 
sions.  Both  methods  involve  numerical  differentiation  of  data,  and  hence  require 
the  boundary  measurements  to  be  fairly  noise-free.  Although  the  fixed  point  formu¬ 
lations  have  limitations,  they  are  constructive,  and  offer  the  possibility  of  proving 
existence  and  uiqueness  results  for  the  inverse  problem.  In  practice  they  converge 
very  rapidly  (typically  to  within  a  few  percent  in  less  than  5  iterations), 

How  expensive  are  these  methods?  One  normally  expects  an  inverse  problem  to  be 
more  expensive  than  the  corresponding  direct  problem.  There  are  several  reasons 
for  this  expectation.  First,  although  the  direct  problem  may  be  linear,  the  inverse 
problem  will  often  be  nonlinear.  Second,  inverse  problems  tend  to  be  ill-posed, 
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which  contributes  to  their  difficulty. 


In  the  case  of  the  fixed  point  methods,  one  can  see  that  the  update  algorithm  for 
/„+ 1  in  terms  of  /„  has  minimal  cost,  provided  one  has  obtained  the  solution  of 
the  direct  problem  U(x,t,fn).  Thus,  the  cost  of  the  fixed  point  algorithms  is  the 
number  of  iterations  required  for  acceptable  convergence  multiplied  by  the  cost  of 
solving  the  direct  problem.  As  mentioned  above,  the  actual  cost  of  the  inverse 
problem  is  on  the  order  of  a  few  times  the  cost  of  a  single  direct  solve.  While 
this  is  certainly  acceptable,  could  further  improvements  be  made?  The  collocation 
method  outlined  in  this  paper  offers  just  such  a  possibility.  Since  the  direct  problem 
is  nonlinear,  if  iterative  methods  are  required  for  accuracy  (for  example  Crank- 
Nicolson),  then  the  cost  of  the  inverse  collocation  method  is  comparable  to  the  cost 
of  a  single  nonlinear  direct  solve.  We  have  performed  several  numerical  experiments 
which  indicate  that  the  scheme  is  fairly  robust  under  noisy  data. 


The  Collocation  Method. 

We  shall  assume  that  the  maximum  values  for  u  at  any  time  occurs  on  the  observed 
boundary  and  that  the  boundary  data  is  monotone.  Without  loss  of  generality,  we 
shall  assume  that  it  is  increasing. 

The  collocation  approach  to  the  solution  of  problem  (B)  can  be  described  as  follows. 
We  consider  a  function  /,  piecewise  linear  on  each  of  the  intervals  [tj,  <>+i],  with 
the  corresponding  function  u(x,f;/). 

(a)  Assume  that  f(v)  is  either  known  or  has  been  approximated  by  a  function  f(v) 

for  v  <  h(tj).  On  the  interval  h(tj+i )]  extend  /  using  the  linearized 

approximation  f(v )  =  A ;  (u  —  h(tj))  +  f(h(tj)) ,  for  some  constant  \j  which 
should  be  chosen  in  order  to  satisfy  the  collocation  condition  u(0, tj+i;f)  = 
h(tj+  j).  We  use  a  secant  method  to  calculate  the  slope  A j  from  the  resulting 
nonlinear  equation 

(b)  Step  (a)  is  continued  for  each  interval  [t j ,  tj+ 1  ] . 

(c)  How  does  one  obtain  the  starting  value  f(h( 0))?  If  the  initial  data  and  bound¬ 

ary  data  axe  compatible  at  t  =  0  then  we  have  f(h( 0))  =  Uq(0)  which  deter¬ 
mines  /  initially.  This  will  be  the  case  if  the  process  has  been  evolving  for 
some  positive  time  interval.  If  the  data  are  incompatible,  we  then  use  colloca¬ 
tion  on  to  determine  a  constant  approximation  to  /  on  this  interval. 

This  results  in  an  error  which  is  O(Ah). 
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A  typical  graph  of  fact  and  /  is  pictured  below. 


There  are  some  remarks  to  be  made  on  the  above  procedure. 

(1)  The  number  of  iterations  required  to  obtain  the  slope  A  on  a  given  collocation 
interval  will  obviously  depend  on  the  tolerance  required,  and  to  the  deviation 
of  the  function  faci  from  linearity  on  this  interval,  For  most  of  the  step  sizes 
and  values  of  an  actual  function  / ,  fact  we  chose,  about  3  iterations  sufficed  to 
obtain  A  to  within  the  same  accuracy  as  the  forwards  solution  u(x ,  t;  f ) .  The 
cost  of  the  method  is  therefore  2  or  3  times  the  cost  of  solving  a  linear  direct 
problem,  since  the  approximation  is  by  a  linear  function  on  each  of  the  intervals 
[*>»*>+ 1]- 

(2)  An  alternative  collocation  method  for  this  problem  would  be  to  let  /  depend  on 
N  parameters  {(ci, . . . ,  c^)} ,  and  impose  N  constraints  to  determine  the  cj  ’s 
by  a  least  squares  procedure.  In  our  situation  this  is  not  necessary  because  for 
a  parabolic  equation  the  values  of  u(xo>  )  do  not  affect  the  values  of  u(xo, <2) 
if  ti  >  1 2  •  Thus  the  value  of  c,  does  not  depend  on  the  value  of  Cj+i .  This 
allows  the  coefficients  to  be  solved  sequentially. 

Let  us  consider  the  particular  example  of  problem  (B),  u<  —  uxx  =  0  for  0  <  x  <  1 , 
0  <  t  <  1  with  ux(0 ,t)  =  /(u(0,f)) ,  — u*(l,f)  =  f(u(l,t))  and  u(x,0)  =  10x3(l  — 
x)2 .  We  choose  /act(u)  =  3 it2  —  2u3 .  The  choice  of  initial  data  was  simply  to  ensure 
compatibility  with  the  boundary  data  at  x  =  0  and  x  =  1 . 

The  function  u(x,t;fact)  was  calculated  numerically  and  the  values  of  h(t)  = 
«(M>;/«c<)  at  the  points  tj  =  j/N  for  j  =  0,  1,  ...  ,  N ,  used  as  data  for 
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the  collocation  scheme.  We  used  a  step  size  of  k  =  0.005  in  time  and  h  =  0.04 
in  the  spatial  direction.  The  table  below  shows  the  difference  of  /  and  /  in  the 
supremum  and  L 2  norms  for  various  values  of  N . 


Convergence  rate  of  /  to  fact  • 

N 

Wfact-fW^ 

ll/«c<-/llL2 

1 

■ 

10 

25 

50 

0.0010 

For  the  interior  f(u)  case,  problem  (I),  let  us  consider  the  particular  example  ut  — 
Uxx  =  l  +  f(u)  u(x,0)  =  0,  itx(0,t)  =  t(t  -  2)  ux(l,t)  =  0,  with  /oct(u)  =  ue1-" . 

Again  we  assume  x  6  [0, 1] ,  t  €  [0, 1] .  The  maximum  principle  guarantees  that  the 
maximum  value  of  u(x,  f)  will,  for  each  fixed  t,  occur  at  x  =  0.  If  we  apply  the 
collocation  scheme  outlined  above  to  this  problem  then  we  might  well  obtain  results 


Clearly  something  has  gone  wrong.  The  above  picture  strongly  suggests  that  the 
inverse  problem  as  defined  is  ill-posed  and  stability  can  only  be  recovered  if  we 
can  find  the  correct  dependence  of  the  overposed  data  on  / .  The  fixed  point  for¬ 
mulation,  which  requires  one  to  evaluate  the  differential  operator  on  the  overposed 
boundary  indicates  the  correct  space.  In  this  case  using  equation  (1.8)  we  see  that  7 
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and  h'  should  have  the  same  regularity  as  /  if  a  fixed  point  is  to  be  obtained.  This 
strongly  suggests  that  we  should  expect  that  the  mapping  h  (-*■  /  is  not  bounded 
but  that  h!  t-»  /  might  be.  This  can  in  fact  be  shown,  [3].  Thus  we  should  expect 
to  include  information  on  the  derivative  of  the  overposed  data  in  our  collocation 
scheme. 

Instead  of  setting  the  value  of  u(0,  t,;/)  equal  to  h(tj)  at  each  of  the  collocation 
points  we  can  find  the  linear  function  /  that  minimizes  the  value  of  l«(0 ,*;;/)  - 
h(tj) |  +  a|ut(0, f,;/)  —  h'(tj) |.  In  practice  we  have  found  that  the  value  of  a  has 
little  effect  on  the  convergence  of  the  scheme  provided  it  is  sufficiently  large.  Instead 
of  this  minimization  which  is  more  expensive  than  the  direct  collocation  we  can  take 
the  limiting  of  large  a  which  is  the  same  is  collocation  on  |ut  —  h'\. 

We  obtained  the  results  in  the  table  below  with  this  procedure. 


Convergence  rate  of  /  to  fact. 

N 

||/act  /II  oo 

11/ act  /|| 

5 

10 

20 

3 1 

0.0089 

40 

0.0116 

0.0045 

80 

0.0065 

0.0034 

We  have  been  able  to  prove  the  following  result  for  the  recovery  of  the  boundary 
unknown,  and  we  are  currently  investigating  analogous  results  for  problem  (I). 

Theorem.  (Convergence  Theorem)  Given  monotonicity  of  u  and  sufficent  regu¬ 
larity  of  the  data,  there  exists  a  piecewise  linear  function  f  such  that  the  solu¬ 
tion  u(x,  t]  f)  to  (B)  satisfies  the  collocation  condition  u(0,  ty;  fjv)  =  h(tj)  for 
j  =  0,  1,  N .  Furthermore,  lim  Np\\f  -  /n||„  =  0,  for  p  <  1/2. 

Corollary.  If  the  target  set  of  boundary  measurements,  {h(tj)}  is  monotone,  there 
exists  a  piecewise  linear  boundary  control,  f,  such  that  u(0,  tj;  /)  =  h(tj). 

Besides  the  simplicity  of  this  method,  the  collocation  procedure  offers  advantages 
over  global  recovery  schemes.  Given  any  situation  where  the  overposed  data  depends 
on  the  function  /  in  a  monotone  manner,  the  collocation  method  can  be  carried 
out  in  principle. 
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